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ABSTRACT 

Distributed  database  management  systems  ( DDBMS )  are  amongst 
the  most  important  and  successful  software  developments  in  this 
decade.  They  are  enabling  the  computing  power  and  data  to  be 
placed  within  the  user  environment  close  to  the  point  of  user 
activities.  The  performance  efficiency  of  DDBMS  is  deeply  related 
to  the  query  processing  strategies  involving  data  transmission 
over  different  nodes  through  the  network.  This  thesis  is  to  study 
the  optimization  of  query  processing  strategies  in  a  distributed 
databases  environment. 

with  the  objective  of  minimum  communication  cost,  we  have 
developed  a  mathematical  model  to  find  a  join-semi join  program 
for  processing  a  given  equi-join  query  in  distributed 
homogenenous  relational  databases.  Rules  for  estimating  the  size 
of  the  derived  relation  is  proposed.  The  distributed  query 
processing  problem  is  formulated  as  dynamic  network  problem.  We 
also  extend  this  model  to  consider  both  communication  cost  and 
local  processing  cost.-\  For  a  simpler  case  where  all  semi  join 
reducibilities  are  zero\and  semi join  reducibilities  do  not  change 
after  join  operation,  \  we  have  shown  that  under  three  different 
objective  functions,  the  problems  of  finding  a  routing  strategy 
of  required  data  to ythe  site  where  a  query  is  initiated  are 
NP-complete.  We  analyze  the  difficult  nature  of  the  query 
processing  prob^am^and  provide  an  analytical  basis  for  heuristic 
aj.qnr  i  thms-r — 

"■'We  extend  this  model  to  query  processing  in  a  distributed 
heterogeneous  databases  environment.  A  heterogeneous  database 
communication  system  is  proposed  to  integrate  heterogeneous 
database  management  systems  to  combine  and  share  information. 
The  use  of  a  database  communication  system  for  heterogeneous 
DBMSs  makes  the  overall  system  transparent  to  users  from  an 
operational  point  of  view.  Problems  of  schema  translation  and 
query  translation  of  the  query  processing  in  this  environment  are 
studied. 
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Chapter  1 
Introduction 

1.1  Background  and  Motivation 

Database  management  systems  are  amongst  the  most 
important  and  successful  software  developments  in  this 
decade.  They  have  already  had  a  significant  impact  in  the 
field  of  data  processing  and  information  retrieval.  Many 
organizations  have  successfully  developed  their  own  database 
management  systems  for  storing  and  accessing  information  to 
help  their  operations.  The  increaing  geographic  dispersion 
of  the  business  activities  within  an  organization  forces  the 
enterprise  to  develop  a  distibuted  database  management 
system  (DDBMS)  in  order  to  provide  its  users  faster  and 
easier  access  to  data  for  decision  making  as  well  as  to  keep 
the  system  reliable  and  secure.  For  example,  in  militay 
Command,  Control  and  Communication  (  C^  )  systems,  data 
gathering  from  sensors  and  commanders  are  distributed  in 
nature.  A  centralized  database  management  system  can  not 
provide  the  availability,  reliability  and  modularity  that  is 
needed. 

The  development  of  computer  networks  since  1970,  and 
the  emergence  of  low-cost,  yet  powerful,  small  mini  and 
micro  computers  makes  it  possible  to  develop  a  distributed 
database  management  system  enabling  the  computing  power  and 
data  to  be  placed  within  the  user  environment  close  to  the 
point  of  user  activities.  The  development  of  DDBMS  are 
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apparently  the  result  of  the  convergence  of  these  different 
technologies.  It  certainly  will  make  possible  both 
economical  and  technical  advantages  such  as  faster  access  to 
data,  better  performance,  increased  reliability,  easier 
upward  expansion,  and  more  information  sharing,  etc. 

However,  the  benefits  of  ODBMS  can  not  be  gained 
without  cost.  Several  problems  inherent  to  distributed 
systems  must  be  solved  as  must  other  problems  related  to 
database  systems  in  general.  These  additional  distributed 
system  problems  come  from  the  slow  transmission  speed, 
narrow  bandwidth  and  possibly  high  failure  rate  of  the 
communication  channels.  The  speed  of  communication 
transmissions  are  slow  compared  to  the  CPU,  main  memory  and 
secondary  storage  times.  These  distributed  system  problems 
include  concurrency  control,  recovery,  database  integrity, 
query  processing,  directory  management,  security,  etc. 

Recent  studies  [ESW  78,  GBWRR  81,  HY  79,  DL  80,  DP  80] 
on  distributed  databases  have  shown  that  the  performance 
efficiency  of  DDBMS  is  deeply  related  to  the  query 
processing  strategies  involving  data  transmission  over 
different  nodes  through  the  network.  Moreover,  the 
communication  technologies  have  not  yet  reached  the  same 
level  as  the  computer  technologies,  both  in  cost  and 
^,,.,'or  .nee,  and  it  is  expected  that  this  situation  will  not 
be  changed  in  the  near  future.  So,  in  this  thesis  we  study 
the  optimization  of  query  processing  strategies  in  a 
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distributed  databases  environment. 

1.2  Assumptions  and  Objectives 

Query  Processing  in  a  distributed  database  environment 
corresponds  to  the  translation  of  a  request  formulated  in  a 
high-level  nonprocedural  language  on  one  sub-system  of  the 
network,  into  a  sequence  of  data  manipulation  statements 
which  retrieve  and  update  data  stored  in  local  DBMSs.  The 
objective  of  this  thesis  is  to  develop  a  quantitative  and 
syntatic  understanding  of  the  optimization  of  query 
processing  strategies  in  a  distributed  database  management 
systems  environment.  Particular  emphasis  will  be  given  to 
the  minimization  of  the  total  amount  of  data  transmission 
cost  required  for  processing  a  single  query. 

This  thesis  starts  with  a  mathematical  model  for 
equi-join  query  processing  in  distributed  relational 
database  management  systems.  It  then  studies  the 
computational  complexity  of  this  problem  and  some  solution 
algorithms.  The  query  processing  in  a  distributed 
heterogeneous  database  environment  is  also  studied. 

Our  basic  underlying  assumptions  in  this  thesis  are: 

1.  It  is  possible  to  exchange  information  amongst  the 
various  systems  and  they  are  willing  to  maintain 
information. 

2.  Each  DBMS  i;  considered  to  be  able  to  execute  a  given 
local  transaction. 
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3.  There  exists  a  communication  network  which  connects  the 
various  DBMSs. 

4.  The  access  to  a  local  DBMS  is  not  affected  by  the 
operation  of  the  data  communication  system  which  should 
be  transparent  to  the  local  user. 

5.  The  communication  cost  is  the  dominant  factor  and  local 
processing  is  essentially  free. 

As  we  are  only  concerned  with  distributed  query 
processing  in  this  thesis,  we  also  assume  that  other 
problems  related  to  distributed  database  management  systems, 
such  as  concurrency  control,  database  integrity,  redundancy, 
recovery  and  security  poblems  have  been  taken  care  of  by 
sow.*  other  components  of  the  system. 

1.3  Organization  of  The  Thesis 

This  thesis  is  organized  into  seven  chapters. 

Chapter  1  contains  the  background  and  motivation  of 
this  thesis.  Query  processing  in  distributed  databases  is 
the  main  theme  of  this  thesis.  Assumptions  about 
distributed  database  environments  are  stated  and  objectives 
of  this  thesis  are  defined. 

Chapter  2  discusses  query  processing  in  a  distributed 
relational  database  management  system  environment.  As 
communication  technologies  have  not  yet  reached  the  same 
level  of  reduction  in  costs  and  of  increases  in  performance 
that  we  can  observe  in  computer  technologies,  we  assume 
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network  traffic  constitutes  the  critical  factor.  We 
formalize  the  problem  of  solving  an  equi-join  query  by  joins 
and  semijoins  mixed  strategies  into  a  mathematical  model. 
Query  processing  for  a  broader  class  of  query  is  also 
discussed. 

Chapter  3  considers  the  computational  complexity  of  the 
query  processing  problem  formulated  in  chapter  2.  Three 
variations  of  simpler  cases  of  the  query  processing  problem 
with  different  objective  functions  are  studied. 

Chapter  4  focuses  on  heuristic  algorithms  for  the  query 
processing  problem.  We  first  consider  the  heuristic 
algorithms  for  the  cases  where  all  possible  semijoins  are 
performed  and  find  a  routing  strategy  for  the  join 
operations  to  solve  the  query.  Next,  we  consider  heuristic 
algorithms  for  the  general  case  where  we  want  to  solve  a 
given  query  by  a  sequence  of  joins  and  semijoins. 

Chapter  5  extends  the  previous  results  on  distributed 
relational  databases  to  the  query  processing  in  distributed 
heterogeneous  databases.  It  begins  with  a  study  of  the 
heterogeneous  world  of  database  management  systems.  The 
architecture  of  a  heterogeneous  database  communication 
system  is  described  to  integrate  those  heterogeneous, 
distributed  and  nonintegrated  database  management  systems. 
Query  processing  in  heterogeneous  database  environments  by 
using  a  database  communication  system  is  discussed. 


Chapter  6  deals  with  two  essential  components  o£  a 
database  communication  system  for  query  processing:  schema 
translators  to  translate  from  the  data  model  schema  of  the 
database  management  system  to  relational  schema  in  order  to 
provide  a  uniform  relational  view  to  the  user;  and  query 
translators  to  translate  relational  algebra  operations  into 
corresponding  data  manipulation  statements  of  the  underlying 
data  model  of  the  system  in  order  to  retrieve  data. 

Chapter  7  provides  a  summary  of  the  major  results  of 
each  chapter  and  suggests  several  potential  areas  for  future 
research. 
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Chapter  2 

Query  Processing  in  Distributed  Relational  Databases 

2.1  Introduction 

Query  processing  in  a  Distributed  Relational  Data  Base 
Management  System  (DDBMS)  corresponds  to  the  translation  of 
requests,  formulated  in  a  high-level  language  on  one  system 
of  the  network,  into  a  sequence  of  relational  algebra 
operations  which  retrieve  and  update  data  stored  in  the 
DDBMS.  The  optimization  process  is  usually  subdivided  into 
two  parts:  The  first  part  is  used  at  the  node  where  the 
query  is  generated  for  analyzing  the  query  before  executing 
it  and  producing  as  output  a  sequence  of  operational 
commands  to  the  local  DBMS,  which  is  optimal  in  terms  of 
some  cost  function  criteria.  The  second  part  is  used  by  the 
local  DBMSs  for  further  analyzing  each  operational  command 
and  producing  the  optimal  local  data  retrieval  strategy.  The 
result  of  these  operations  constitutes  the  final  response 
returned  to  the  user. 

In  distributed  query  processing,  the  execution  of  a 
query  involves  data  transmissions  which  take  significant 
time  in  comparison  with  the  subquery  and  elementary 
operation  execution  times.  However,  on  the  other  hand,  the 
distribution  of  the  system  makes  possible  the  parallel 
processing  of  local  elementary  operations,  which  is 
beneficial.  The  cost  function  of  processing  a  query  in  a 
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DDBMS  environment  depends  on  the  following  parameters: 

1.  Total  transmission  time. 

2.  Total  execution  time. 

3.  Total  resources  usage. 

4.  Total  response  time. 

Some  of  the  parameters  are  dependent  on  others, 
therefore  the  cost  function  will  depend  on  all  four 
parameters  and  may  not  be  linear.  In  a  distributed  DBMS, 
the  main  bottle-necks  of  the  system  are  the  transmission 
delays  of  the  inter-computer  communications.  The  cost 
function  may  be  simplified  by  considering  only  the  quantity 
of  data  transmitted  through  the  network.  Most  of  the  query 
processing  algorithms  proposed  to  date  concern  only  the 
reduction  of  the  total  transmission  time. 

In  this  chapter,  we  first  formalize  some  definitions 
for  distributed  relational  DBMS  and  derive  some  basic 
results  in  section  2.2.  In  section  2.3,  we  review  the 
literature  of  distributed  query  processing  algorithms.  We 
extend  previous  work  in  two  directions:  One  is  that  we  want 
to  solve  a  distributed  query  by  using  a  join  and  semijoin 
mixed  strategy.  The  other  is  that  we  want  to  formalize  the 
problem  in  a  mathematical  formulation.  In  section  2.4,  we 
develop  a  distributed  query  processing  model  for  a  class  of 
conjunctive  equi-join  queries  and  formulate  the  problem  as  a 
dynamic  network  problem.  In  section  2.5,  we  extend  the 
model  to  general  query  processing  where  inequality- join 
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clauses  could  appear.  The  model  can  also  be  extended  to 
include  the  local  processing  cost  into  consideration. 

2.2  Relational  Terminology  and  Basic  Lemmas 

In  this  section,  we  formalize  the  time-invariant 
description  of  a  relation  and  a  database. 

2.2.1  Schema 

A  domain  is  a  set  of  values.  A  relation  is  any  subset 

of  the  cartesian  product  of  one  or  more  domains.  The  members 

of  a  relation  are  called  tuples.  A  tuple  t»(t(  ,tx, . . . ,tn) 

has  n  components;  the  i-th  component  of  t  is  t^  .  We  can 

view  a  relation  as  a  table,  where  each  row  is  a  tuple  and 

each  column  corresponds  to  one  component.  The  columns  are 

often  given  names,  called  attributes.  Let  U  be  a  set  of 

attributes,  called  a  Universe.  Associated  with  each 

attribute  A€U  is  a  domain,  DOM ( A ) .  A  relation  schema  R  is  a 

list  of  attribute  names  for  a  relation,  i.e.  a  subset  of  U. 

Let  R^«{  A,  ,AX  ,...,Am)  be  a  relation  schema.  We  will  use 

the  notation  R;.AKto  denote  the  attribute  AK  of  relation  R^ 

1^) 

.  The  domain  of  R:  is  DOM(R-  )*  .X  DOM(R..A;  ).  Let  R.,  Ra 

*  *  ^  j 

Rn  be  relation  schemas.  A  database  schema  D  is 

defined  as  a  set  of  relation  schemas  {  R, ,  Rif...f  R«}.  U(D) 

n 

is  the  attribute  set  of  D,  i.e.  U(D)>  U  R • .  The  domain  of  D 

4»l 

is  denoted  DOM(D)  and  is  defined  to  be  X  DOM ( R • ) .  Let  D. , 

4«» 

Da,...,Dp  be  a  set  of  database  schemas.  Bach  Dk  corresponds 
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to  a  database  at  one  location.  A  distributed  database  schema 
DD  is  defined  as  a  set  of  database  schemas  {D, ,0^ , . . . ,D^  } 
and  U(DD)  is  the  set  of  attributes  in  DD. 


When  set  notation  is  used  (e.g.  R-  U  R*  ,  Av€  R • ,  R.  c  R. 

) ,  the  sets  in  question  are  understood  to  be  sets  of 

attributes.  R:  ■  R-.  iff  R.  £  R;  and  R:c  R • ,  i.e.  R*  and  Rv 

A  f  *<  X  i  *  A  « 

are  two  relation  schemas  having  the  same  set  of  attributes. 
Two  database  schemas.  D  «{R(  ,Ra , . . .  ,R^}  and  D  »{S|  ,Sa  . . . .  ,S^ 
},  are  equivalent  if  there  exists  a  one-to-one 
correspondence  between  their  relations  with  respect  to  the 
equality  of  two  relation  schemas,  i.e.  there  exists  a 


n-permutation  o' such  that  R^-Sg-^j 


for  all  i*l,2 


n. 


Definition: 

We  define  as  follows  the  size  of  a  database  schema: 

The  size  of  a  distributed  database  schema  DD,  IDDI  , 

/lrL 

is  the  number  of  different  sites  in  DD. 


The  relation  size  of  a  database  schema  D* ,  |D^|^  ,  is 

the  number  of  relation  schemas  in  D  . 


The  relation  size  of  a  distributed  database  schema  DD, 

| DD |  ,  is  equal  to 
WPPln. 

%-  i^v 

/€.*!  r 


The  attribute  size  of  a  relation  schema  R* ,  |  R  •  |  ,  is 
the  number  of  attributes  in  R; . 
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The  attribute  size  of  a  database  schema  D  ,  |  D  | ,  is 

n  A 

equal  to  |U(D)|,  where  U(D)-U  U(RX) . 


DEFINITION: 


D.is  a  database  subschema  of  D;  ,  denoted  by  D.$D:r  if 

*  i  *  » 

for  all  D*,  there  exists  R.€  D.  such  that  R  £  R 
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R„€  D:  is  maximal  if  there  exists  no  R. €  D.  such  that 
K  *  JL  At 

. 

DEFINITION: 

For  each  D.  #  A€U(D^),  we  define  s(A,D<i)»{R|e|  (R^D^  ) 

and  (a€rk)'|,  the  set  of  all  relation  schemas  in  D^  for 
which  A  is  an  attribute. 

For  A€U(DD),  we  define  s(A)-{D^  |  (D^  6  DD)  and  (|s(A,Dx 

mm 

DEFINITION: 

Attribute  A  is  said  to  be  isolated  in  DD  if  |s(A)|*l. 
i.e.  if  A  is  an  attribute  of  only  one  D^. 

DEFINITION: 

A  schema  graph  for  a  database  schema  D»{R,  ,R2  ,...,Ra] 
is  a  graph  Gp»<Vp,Ep>  with  node  set  Vp-D  and  edge(Rx»Rj 
KE^  iff  O(R^)  AO(Rj)^. 

A  schema  graph  for  distributed  database  schema  DD«{D,  ,DX 
, Dp}  is  a  graph  GPP-<VPP'EPD>  with  node  set  Vpp«DD 
and  edge  (D^DjteEppif  f  0(0^ )  AU(Dj )*f. 
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2.2.2  Database  state 

Let  R-{A, , Aj, . . ,A„J  be  a  relation  schema.  A  tuple  T 
over  R  is  a  mapping  which  associates  with  each  attribute  a 
value  out  of  a  distinct  domain  associated  with  the 
attribute.  When  the  order  of  attributes  in  R  is  fixed,  we 
may  write  T  as  an  m-tuple  T-(T(A, ) ,T(Aa) , . . . ^(A^  ) . 

Definition: 

1.  A  relation  state  r  of  R  is  a  finite  set  of  tuples  over 
R.  r  may  be  visualized  as  a  table  of  data  whose 
columns  are  labeled  A.,  A,,...,  A  and  whose  rows  are 

•  •  -frv 

tuples. 

2.  If  D«{R, ,Ra, . . . ,  R^} ,  then  a  database  state  for  D  is 
d«{r,  , ra  , ...,  r^}  where  for  l£i$n,  r^  is  a  relation 
state  for  R^  ;  we  denote  the  database  state  by  d. 

3.  If  DD"{D, ,Da , . . .0^ }  then  a  distributed  database  state 
for  DD  is  dd«{d, ,da , . . .  ,dp  }  where  for  I3k$m,  d^  is  a 
database  state  for  D^  ,  we  denote  the  distributed 
database  state  by  dd. 

Note  that,  if  there  is  no  confusion,  we  will  use 
capital  letters  (e.g.  R,  D,  DD)  to  represent  both  schema  and 
state  through  out  the  thesis.  For  example,  relation  R»(S,r) 
is  to  represent  both  its  relation  schema  S  and  relation 
state  r. 

Example:  Relations  and  Databases 
(1)  Three  relation  schemas: 
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CLIENT* { CNUMBER ,  NAME,  ADDR,  AGE,  BIRTHDA?} 

AGENT*  {ANUMBER,  NAME,  ADDR,  TERRITORY,  SENIORITY} 

SALE-  {ANUMBER,  CNUMBER,  POLICYNO,  DATE} 

(2}  Database  schema: 

D  *  {  CLIENT,  AGENT,  SALE  ) 

(3)  Set  o£  attributes: 

U( CLIENT) -{CNUMBER,  NAME,  ADDR,  AGE,  BIRTHDAY} 

U(D)- {CNUMBER,  NAME,  ADDR,  AGE,  BIRTHDAY,  ANUMBER, 
TERRITORY,  SENIORITY,  POLICYNO,  DATE} 

(4)  Relation  states  and  database  state: 

CLIENT 

|  CNUMBER  |  NAME  |  ADDR  |  AGE  |  BIRTHDAY  | 

|  11  |  TOTO  |  MAIN  ST.  |  32  |  12/18  | 

|  12  |  YOYO  |  WEST  ST.  |  41  |  3/14  | 

|  13  |  LILI  |  EAST  ST.  |  26  |  6/30  | 

|  14  |  PAPA  |  NORTH  ST.  |  29  |  5/11  | 

|  15  |  DADA  |  MAIN  ST.  |  30  |  7/31  | 

I  16  |  QUQU  j  EAST  ST.  J  37  |  9/17  | 

AGENT 

|  ANUMBER  |  NAME  |  ADDR  |  TERRITORY  |  SENIORITY  | 

|  A1  |  JOHN  |  MAIN  ST.  |  CENTER  |  10  ( 

|  A2  |  TOM  |  WEST  ST.  |  WEST  |  8  | 

|  A3  |  PAUL  |  NORTH  ST.  |  NORTH  |  5  | 

|  A4  |  DICK  |  SOUTH  ST.  |  SOUTH  |  3  | 

|  A5  |  JEFF  |  EAST  ST.  |  EAST  |  11  | 


|  ANUMBER  |  C NUMBER  |  P0LIC7N0  | 


DATE 


A1 

i  U 

1 

3 

1  4/78 

A2 

1  12 

1 

4 

|  5/SC 

A5 

1  13 

1 

7 

|  9/80 

A3 

1  14 

1 

1 

1  9/81 

A1 

1  15 

1 

4 

|  12/81 

A5 

1  15 

1 

1 

|  12/81 

2.2.3  Operator 

The  operators  we  use  are  a  subset  of  the  relational 
algebra  [CODD  72]. 

Definition: 

Let  R»{A,  ,KX, . . .  fAjyJ ,  and  let  r  be  the  relation  state 
of  R  and  t«{t,  ,tz  ,  ...,tm)  be  a  tuple  in  r.  The 
projection  of  tuple  t  on  attributes  7-JA^,  rA^  , . . . , A^m 
}C  R  is  t [ Y]  —  < t^,  ,tia  , ...rt^m  ).  That  is,  the 
projection  is  obtained  by  removing  from  t  those 
components  corresponding  to  attributes  not  in  7. 

The  projection  of  relation  state  r  onto  an  attribute 
set  7  is  r[7]-{s'|  (s'»s[7])  and  (s  is  a  tuple  of  r)). 
Also  denoted  as  TTY[r]* 
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Definition: 

Let  R,a(S,  ,r, )  and  Ra»(Sa  ,rx)  be  two  relations  with 
the  sane  relation  schemas,  i.e.  S  ,  »s2  .  The 

intersection  of  these  two  relations,  denoted  by  R (  ft  r2 
,  is  the  relation  (S,r),  where  S  -S,  -S2  is  the 
relation  schema  and  r«{t|t€r,  A  rx  }  is  the  relation 
state. 

Definition: 

Let  R  and  S  be  relations  of  arity  (number  of 
attributes)  h  and  k,  respectively.  The  Cartesian 
product  of  R  and  S,  denoted  by  R  x  S  is  the  set  of 
(h+k)~tuples  whose  first  h  components  form  a  tuple  in 
R  and  whose  last  k  components  form  a  tuple  in  S. 

Definition: 

« 

Let  {R-  }.  be  a  set  of  relations  and  the  schema  of 

At  l  •! 

relation  R^  be  the  set  of  attributes  {A^,  ,A ^  .  »A£|#. 

}.  A  qualification  q  is  a  formula  of  clauses  of  the 
forms  (R^  .A^e  6  Rj  .A^  )  and  (R^  ,Ait  0  C  ),  where 
0  is  an  arithmetic  operator  (  <,  «,  >,  <■ ,  >•  or  ■  ) 

and  C  is  a  constant  value  in  the  domain  of  the 
attribute  A^e  of  R^  ,  connected  by  logical  operators  (A 
,  V,  and  n  ). 

Definition: 

The  selection  of  relation  R  on  qualification  q. 
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denoted  fl^(R)-  , t2  , . . .  f  trt)£  R|  q  (t,  , ta, . . .  ,tn) 

is  true],  i.e.  0~g  (R)  is  the  set  of  tuples  in  R  that 
satisfy  q. 


Let  R  . 

A, 


and  R 


be  two  relations 


Definition: 


and  Y  C  R .  f\  R  . 

A  S 


The  equi-join  of  R^  and  R  •  on  Y,  denoted  by  R^  |X| 

,  is  the  resulting  relation  of  computing  {t|t  is  a 
tuple  in  R^x  Rj,  ,  such  that  ttR^.Y]  ■  t[R*  ,Y]  J  and 
then  projecting  out  columns  Rj.Y. 

The  semi  join  of  R:and  R- on  Y,  denoted  by  R  •  IX  R .  , 

t  *  V  J 

equals  R^  |X|  Rj[Y],  Equivalently  it  equals  {t^|  (t^ 

6r.)  and  (3  tj€rj  *  tJYJ-tj  [Y] )] . 


The  natural  join  of  R^  t  Rj  ,  denoted  by  R^.  |X|  Rj  , 

is  the  join  of  R?  &  R;on  R.n  R;  . 

*  j  *  J 

The  natural  semi  join  of  R-i  Rj  ,  denoted  by  R^.  |X  R  •  , 

is  the  semi  join  of  R  •  fc  R.  on  R  •  f\  R:  . 

*  J  A  J 


Note  that  the  natural  join  is  commutative-  i.e. 
R^  |X|  Rj  -  Rj  |X|  R^  . 


Definition: 

The  natural  join  of  R  *  - ( S; ,  r  • ) ,  R  •  » 

-(SK,rK)r  denoted  by  Rx  |X|Rj  |X|RR  , 

relation  of  (R*  |X|R>)  |X|  R*  . 

Yti  Y^Tj*  * 

and  Y.  U  Y,  -(S.  U  SOrtS*  . 
a*  J|c  a.  ;  K 


(S.  ,rj  )  and  R^ 

is  the  resulting 

Here  Y  ••  »S  .  A  S- 

J  A  J 


Ji 
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Lemma:  For  natural  joins,  we  have 


Proof: 


(R.'XI  Rj)  HI  |X|  (  »J  |I| 

Yv  YjkUYjk  'lijL/tx,  Y, 


I ) 


K  '* 


1.  The  relation  schemas  of  both  resulting  relations 
above  are  the  same,  which  is  S  .  U  S-  U  S*  . 

2.  Let  r,  be  the  state  of  (R.  |X|  R.  )  \x\  R  ,  r, 

w  i  Yt^Yjw.  *  * 

be  the  state  of  R.  |X|  (  R  |X|  R  K  )  and  r.  be 

v  YijuY*  j  'Y;k  *  a 

the  state  of  R  .  |X|  R.  .  We  can  derive 


\  *  {  1 1  3  tjj €  tjj  6  t0  e  R^  |  X  |  R  •  S.t . 

J 

fc[RK  '  fctRX  0  Ri  ]“*.  1 


■  C  t|3tK4rK,  t^erj,  tj  6  r .  i  t06  RJXIR^  s.t. 
t[RK]-tK,  t [R^  ]*t,[R^  J-t^.  and 
t[Rj  ]-t#[R  .  ]-tp 


i  .e. 


■  {  t|3tK€rK,  r^  ,  tj€  r^  s.t. 

MRnl-t*,  t[R^]»t^  and  ( 

t [R j  ]-tj  } 

Similary,  we  can  derive  v z  -  the  formula  (*).  Thus  r, 
•r^  ,  and  the  lemma  follows. 

Note  that  the  natural  join  operation  is  associative, 

<rA,  1*1  Rj)  1*1  R,c  *  R^  1*1  (  Rj  1*1  Rk). 


Example : 


I  7 

±L 

Aa  1 

A3  1 

l*i 

*  m 

i 

A3 

_l_ 

^4 

1  l 

1 

11 

1 

21  | 

31  | 

1  21 

1 

31 

1 

41 

1 

1 

12 

22  | 

31  1 

|  22 

32 

42 

1 

i  13 


I  14 


H,  |X|  R* 

{A3} 


R,  | X 


A. 

Aa 

1 

A3 

AA 

> 

11 

21 

1 

31 

21 

41 

12 

22 

1 

31 

21 

41 

13 

21 

1 

32 

22 

42 

14 

22 

1 

32 

22 

42 

A,  I  AX  |  Aj  |  A+  | 


11  |  21  |  31  |  41  | 


14  |  22  |  32  |  42  | 


This  is  the  natural  join  since  {Aj,  Aj}*R,  f\  Ra , 


(2)  Semijoin  R,Z|  R2 
Y 
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R.XI  R»  ia  to  join  - 

{A3}  MM 

MM 

I  32  j 


with  R. 


I  21_j_31_|_41^| 
|  22  |  32  |  42  | 


R  i  X |  Ra  is  to  join - 

{A»'A3*  L±*-Lt*J- 

LJLLILI 

1  22  1_31_| 
I  21_l_32_| 
I  22  |  32  | 


with  Ra.  ■ 


I  21  1  31  MM 
|  22  I  32  I  42  I 


This  is  a  natural  semi  join  since  {Aa,Aj  j«R<  {%  ^ 
(3)  R,  |X|  R2  |Z|  Rj 


*2. 

31 


31 

32 
32 


A4  I  A5  I 

■  •  w  «  •  •*  — 

41  |  51  | 

43  |  53  | 

44  |  52  | 
44  |  51  | 


Then  R,  |X|  R2  |X|  R3 


(  R,  1*1  R x  )  1*1  4  *3 

{  A^ » Aj  }  (Aj^j 

I  11  I  21  I  31  MM  51  I 


2.2.4  Query 

Given  a  database  D-{R, ,Ra, . . . ,  Rnh  a  query  can  usually 
be  written  in  a  number  of  alternative  algebraic  expressions. 
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In  particular  each  query  can  be  put  in  the  following  form: 

Q  *  "71^  OJOtjXR^x...  xRn) 

where  TL  contains  the  attributes  in  the  answer  relation;  q 
is  a  qualification  and  each  R^  is  a  relation.  Usually,  TL  is 
referred  to  as  the  target-list.  We  shall  assume  all 
queries  are  expressed  in  this  canonical  form,  denoted  by 
Q-(q,  TL).  A  query  Q  is  called  single  relational  (SR)  if  TL 
is  a  subset  of  some  relation  schema  in  0,  otherwise,  Q  is 
multi relational  (MR). 

Definition: 

A  query  Q-(q,  TL)  is  a  conjunctive  equi-join  query  if 
the  qualification  q  is  a  conjunction  of  equi-join 
clauses  of  the  form  (R^.  X«  R^.Y),  where  X  and  Y  are 
subsets  of  attributes  of  R^  and  Rj  respectively. 

Note  that  equi-join  queries  do  not  include  one-relation 
clauses  of  the  form  (R^.Y  -constant).  One-relation  clauses 
were  excluded  because  they  correspond  to  local  operations 
and  are  generally  evaluated  locally  before  join  and  semijoin 
operations  are  applied. 

Let  DD-{R| ,  Rj,.*.,  Rpl  be  a  distributed  database 
schema  and  Q-(q,TL)  be  a  query  over  DD.  For  every 
distributed  database  state  dd,  the  answer  of  Q  over  (DD,dd), 
denoted  by  Q(dd),  is  Q(dd)-  TT-fl.  0$  (r,  xr2  x. . .  xrp). 


Definition:  (equivalence) 

Given  a  database  schema  DD,  two  queries  Q,  and  Qa  are 
equivalent,  denoted  Q(  3  Qa  ,  iff  for  all  database 
states  dd,  Q  ((dd)>Q^(dd) . 

Definition: 

Let  Q«(q,TL)  be  a  query.  The  transitive  closure  of  Q 
is  a  query  Q4  ■  (q4  ,TL)  whose  qualification  q4, 

includes  q  and  all  clauses  implied  by  q  under 
transitivity,  (e.g.  If  (R,  .A( -R^.A^)  and  (R^.A^-R^.Aj ) 
are  in  q,  then  (Rt .A,  -R^.Aj)  is  in  q4). 

Lemma: 

Given  a  database  schema,  a  query  Q  is  equivalent  to 
its  closure  Q4.  i.e.  Q  3  Q4  . 

Proof : 

Let  Q«(q,TL)  and  Q4»(q4,  TL).  By  the  definition  of  q4, 
for  each  database  state  dd»{r,  ,r2,.*.,rn),  any  tuple  t 
in  r(  xra  x...xr„  that  satisfies  q  satisfies  every 
clause  in  q.  Since  every  clause  in  q4  is  either  a 
clause  in  q  or  a  clause  derived  from  clauses  in  q  by 
transitivity,  the  tuple  t  also  satisfies  every  clause 
in  q4  .  The  other  direction  is  also  true.  So  * 

r^  x  ...  x  rM  )  »  fl^.(r,  x  t3x  ...  x  rn).  Also  the 

target  list  of  Q  and  Q  are  the  same.  This  implies 
Q(dd)«Q4(dd).  Thus  Q  -  Q4  . 

Definition: 

A  qualification  q  is  called  sub-natural  iff  for  each 
clause  •  Aji. , 
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A  qualification  q  is  called  natural  iff  q  is  a 

subnatural  qualification  and  for  all  relation  schemas 

R*  and  R-,  and  for  all  A.€  R.  A  R;,R:  .A*  *R*  is  a 
^  g  It  a  t  *  g  it 

clause  of  q. 

Definition: 

Given  a  database  schema  D«{R,  ,R2  ,  a  query 

Q»(q,TL)is  called  a  natural  join  query  (NJQ)  (resp. 
sub-natural  join  query  ,  SNJQ)  iff  q  is  a  natural 
qualification  (resp.  sub-natural  qualification)  for  it 
and  TL£U(D). 

2.2.5  Qual  Graph 

Next  we  define  qual  graphs  for  the  class  of  sub-natural 
join  (SNJ)  qualifications.  Qual  graphs  are  another  structure 
for  specifying  sub-natural  queries.  Let  D  be  a  database 

-TV. 

schema  and  let  U(D)  »  U  R • . 

A*l  * 

Definition: 

A  qual  graph  for  Qa(qrTL)  over  D  is  an  edge~labelled 
undirected  graph  Ge  -<VA,Ea,L#>,  where  Vecontains  one 

V<P> 

node  per  relation  schema  in  D,  xVft  ,  and  LQ:Efl->  2 

,  where  2 UCP*  is  the  set  of  all  subsets  of  U(D),  with 

L  [<R*,Rj,>]« 

Note  that: 

1.  The  qual  graph  of  Q»(q,TL)  is  uniquely  defined  by 
the  qualification  q. 

2.  The  label  of  edge  (Rj,Rj,)r  (R^  »R^) }  may  be  the 
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empty  set 


Definition: 


The  transitive  closure  of  ,  Eft,  Lft>,  denoted  by 

0><  vj,  el,  1 1>  i*  v„,  e*.  E0,  and 


a 


a 


a 


LftHRi'R> ))“{  O(D)  |  there  is  a  path  from  R^  to  R- 

«  K  f 

in  G  |  such  that  A^is  in  the  label  of  each  edge  on  the 
path  }. 


Lemma  1: 

Let  Q«(q,TL)  be  a  sub-natural  join  graph  and  its  qual 
graph  be  Ga-<V&,Eft,La>  ,  then  Gj  »  Gg*.. 

Lemma  2: 

Let  Q»(q,TL)  be  a  natural  join  query  then  Gfl  «  g£  -G^ 


Lemma  3: 

Let  Q,*(q ,,  TL, )  and  Qa"(q2,TLa)  be  sub-natural  join 
queries.  Q,  s  Qa  iff  Ggf»  g£x  and  TL(»TLa. 

In  the  sequel  we  will  only  consider  sub-natural  join 
queries  whose  qual  graphs  are  connected.  A  query  whose  qual 
graph  is  disconnected  produces  a  result  that  is  the 
Cartesian  product  of  database  substates  produced  by  each 
connected  component.  Since  these  connected  components  are 
not  joined  in  any  way,  there  is  no  loss  of  generality  in 
treating  the  components  seperately. 
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Definition: 

A  sub-natural  join  query  Q«(q,TL)  is  called  a  tree 
query  iff  there  exists  a  connected  qual  graph  G&'  such 
that  Q'  s  Q  and  G^  is  a  tree.  All  other  sub-natural 
join  queries  with  connected  qual  graphs  which  do  not 
have  equivalent  queries  with  tree  qual  graphs  are 
called  cyclic  queries. 


Example: 

Let  Rj »{A, C} #  Ra-{A,D},  Rj-{A,B)  and  R4«{B,D,E}. 

1.  Qj-tq^TL),  where  qt*  (R,  .A  *  Ra . A)  A  (R^.A  ■  Rj.A) 

(R I . A  -  R3.A)  A  (Rj-B  -  R^.B) 
Q,  is  a  subnatural  join  query. 

The  qual  graph  GQ  of  Q,  is  not  a  tree. 


R^.A) 


(Rj.A  -  R4.A) 
The  qual  graph  Gg  of  is  a  tree. 

A\  A 


V 


B 


4 
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Thus,  Q2  is  a  tree  query  because  »  Gg 
(i.e.  Q  ■  Q,  ),  and  Gfi  is  a  tree, 
we  see  that  Q(  is  also  a  tree  query. 

2.  Q3-(qj,TL),  where  q  *  (R( .A  ■  R*.A)  A  (R^.A  -  R3.A)A 

(Rj  >0  m  R^.  0)  A  (R^.p  ■  R^  »p ) 


From  6^  we  can  see  that  all  queries  equivalent  to 
have  cyclic  qual  graphs.  Thus,  is  a  cyclic 
query. 

Note  that,  for  this  thesis,  we  will  consider  an 
operation  or  a  query  to  be  a  mapping  from  a  temporary 
database  state  to  a  new  temporary  database  state.  During  the 
analysis  of  a  query  solution  strategy,  the  database  state  is 


conceptually  considered  changing  from  one  state  to  the  other 
state  by  an  operation.  The  real  database  state  stored  in  the 
system  does  not  change  unless  update  operations  are  really 
performed. 

2.3  Review  of  Previous  Works 
2.3.1.  [WONG  77]  fc  [ESW  78] 

Wong's  algorithm  is  the  first  comprehensive  solution  to 
the  distributed  query  processing  problem  (DQPP).  It  was 
implemented  in  SDD-1.  The  assumption  of  the  system 
environment  is  the  following: 

1.  Each  system  is  a  relational,  nonredundant  DBMS 

(i.e.  unique  copy  of  data). 

2.  The  final  result  of  a  query  is  produced  at  a  single 

site. 

3.  Each  system  can  MOVE  fragments  of  relations  to 

another  system. 

4.  The  communication  cost  is  a  function  of  transmitted 

data  volume  and  the  goal  is  to  minimize  the 

communication  cost. 

This  algorithm  translates  a  query  Q  into  a  sequence  of 
relational  algebra  operations  (selection,  projection  and 
join)  and  MOVE  operations  (move  portions  of  relations  from 
one  site  to  another).  It  first  selects  a  final  processing 
site,  S0  ,  and  constructs  an  initial  feasible  solution: 
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Move  all  R  referenced  by  Q  to  S0. 

Process  Q  at  S#  as  a  local  query. 

The  solution  is  improved  by  recursively  replacing  a 
MOVE  by  lower  cost  sequences  of  MOVES  and  relational 
operations.  It  terminates  when  no  MOVE  can  be  replaced  by  a 
lower  cost  sequence.  This  algorithm  produces  increasingly 
efficient  sequences  of  commands  by  its  hill-climbing 
discipline.  Since  at  each  step  of  refinement  the  best 
alternative  is  chosen,  this  algorithm  can  be  thought  as  a 
greedy  heuristic  algorithm.  The  critical  point  of  this 
algorithm  is  that  the  heuristic  is  too  weak  to  guarantee 
optimality  and  it  has  no  analytical  tool  for  the  evaluation 
of  traffic  volumes  when  MOVE  is  executed. 

Wong's  algorithm  has  been  also  adopted  for  the 
distributed  version  of  INGRES  [BSW  78].  The  algorithm  begins 
by  executing  all  one  variable  subqueries  to  obtain  reduced 
relations.  Each  site  sends  a  short  description  of  these 
relations  to  the  MASTER  INGRES  site  where  the  query  has  been 
originated  so  that  the  MASTER  site  knows  which  sites  are 
involved  in  the  query  processing.  This  algorithm  then  breaks 
the  qualification  into  separate  pieces  using  a  few  simple 
heuristics.  Consequently,  the  sequence  of  distributed 
operations  is  decided  by  means  of  the  quantitative 
information  obtained.  Two  cost  criteria,  minimum  response 
time  and  minimum  communication  traffic  are  considered. 
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2.3.2.  [HY  79] 

Hevner  and  Yao  present  another  approach  to  distributed 
query  processing  that  can  be  interpreted  as  a  semijoin 
approach.  They  consider  a  class  of  simple  queries  in  which, 
after  initial  local  processing,  each  relation  in  the  query 

contains  only  one  attribute  -  the  common  joining 

attribute.  All  relations  are  joined  on  this  single 
attribute.  If  we  assume  the  only  attribute  name  is  A,  the 
qualification  of  a  simple  query  can  be  written  as 


n-l 

q«.  -  A  (Rf* 
5  lai 


R* 


•  A  ) 


By  the  transitive  rule,  the  closure  is 


h-l 

A  A  (R..A 

i»l  )tl  * 


R;.A  ) 


From  the  qual  graph  G^ 
spanning  tree  of 


,  It  is  easy  to  see  that  any 
corresponds  to  a  query  equivalent  to 
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the  simple  query. 

Mote  that  for  a  simple  query,  intersection,  join  and 
semijoin  are  identical  operations.  The  optimization  problem 
for  this  class  of  queries  is  trivial.  They  propose  two 
algorithms  whose  cost  functions  are  the  global  response  time 
and  response  time  related  only  to  the  network  traffic  (total 
transmission  time).  The  first  one  implies  the  maximum 
degree  of  parallelism  of  query  execution.  The  quadratic  time 
bound  is  proved  for  this  case.  The  second  strategy 
corresponds  to  the  minimization  of  network  traffic  and  a 
linear  time  algorithm  is  presented. 

The  authors  attempt  to  generalize  this  algorithm  for 

arbitrary  equi-join  queries.  For  a  relation  R^*(St,r^), 

where  -{A^  -A*» . W  ■  they  define  the  selectivity 

for  each  domain  DomfR^.A^-)  to  be  the  number  of  values  of 

Dom(R.,A<i.  )  currently  appearing  in  the  column  R.  of 

J  * 

relation  state  r  divided  by  the  cardinality  of  Dom(R< .A r:  ) . 
They  assumed  the  selectivity  on  one  domain  does  not  affect 
the  selectivity  of  the  other  joining  domain.  Therefore, 
each  joining  domain  in  the  relation  R^is  handled  separately. 
A  heuristic  algorithm  that  uses  an  improved  exhaustive 
search  is  proposed  for  the  general  queries. 

2.3.3  [GBWRR  81],  [BG  81]  and  [BG  80] 

Zn  [GBWRR  81],  an  algorithm  is  proposed  in  which 
semi- join  concepts  are  exploited  in  order  to  refine  the 
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approach  of  Wong's  algorithm.  In  this  algorithm,  a  query  Q 
is  processed  in  two  phases.  Phase  1  is  called  the  reduction 
phase.  It  seeks  to  reduce  the  database  state  with  respect  to 
Q.  The  use  of  semijoins  is  the  principal  tactic  in  this 
phase.  A  sequence  of  semi  join  operations  SJ  is  said  to 
reduce  a  database  state  D  for  query  Q  if  we  may  apply  SJ  to 
D  without  affecting  Q's  answer,  i.e.  Q(D)»Q(SJ(D) ) .  A 
sequence  of  semijoins  is  called  a  reducer  for  Q  if  it 
reduces  every  database  state  for  Q.  A  sequence  of  semijoins 
SJ  is  cost  effective  in  state  D  if  the  amount  of  data 
requiring  inter~site  data  transmission  in  order  to  compute 
SJ (D)  is  less  than  or  equal  to  the  quantity  of  data  in  D 
that  will  be  eliminated  by  SJ.  The  goal  of  the  reduction 
phase  is  to  translate  Q  into  a  cost  effective  reducer.  This 
reduction  phase  is  known  as  the  full  reducer  problem. 

Phase  2  is  the  final  processing  phase.  The  system 
selects  one  site  as  the  final  processing  site  and  the 
reduced  databases  of  the  other  sites  are  transmitted  to  the 
final  site.  The  system  then  excutes  Q  against  these 
databases  at  that  site  as  a  ?cal  query.  The  final 
processing  phase  of  SDD-1  is  very  simple.  The  core  of  the 
SDD~1  query  processing  algorithm  is  the  reduction  phase. 

The  authors  present  a  heuristic  algorithm  that  solves 
this  problem  for  a  class  of  equi-join  queries.  By  defining 
rules  for  estimating  the  cost  and  effectiveness  of 
semijoins,  the  algorithm  starts  with  an  initial  feasible 


solution  consisting  of  null  reducer,  i.e.  an  empty  sequnce 
of  semijoins.  The  algorithm  improves  the  initial  solution  by 
iteratively  appending  cost  effective  semi-joins  to  it.  When 
all  cost  effective  semi- joins  have  been  exhausted,  the  basic 
optimization  is  complete.  At  this  point  the  algorithm  is  the 
same  as  Wong’s  algorithm.  Let  the  sequence  of  semi joins  just 
constucted  be  SJ.  The  algorithm  next  permutes  the  order  of 
semi joins  in  SJ  such  that  the  effectiveness  of  SJ  is 
increased  while  its  cost  is  decreased.  Finally,  the 
algorithm  selects  a  final  processing  site  and  prunes 
semijoins  in  SJ  that  are  made  unnecessary  by  the  choice  of 
final  site.  The  resulting  sequence  of  semi joins  is  executed 
and  quaranteed  to  be  a  cost  effective  reducer  for  Q.  It  is 
still  not  guaranteed  to  be  optimal. 

2.3.4  [CHIU  79]  and  [CH  80] 

The  use  of  semijoins  for  distributed  query  processing 
is  also  studied  by  Chiu.  He  considers  a  sub-class  of 
equi-join  queries  whose  qualification  can  be  written  as 


n-1 

qc  «  A  (  VA*  ,A*  }  Aa  ♦  Aj 

The  qual  graph  of  this  queiry  when  n»5  is: 
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In  this  class  of  query,  every  joining  attribute  of  one 
relation  joins  to  exactly  one  attribute  of  another  relation. 
This  sub-class  is  called  a  chain  query  because  the  graph 
representation  of  it  is  a  chain.  This  class  of  queries  has 
the  property  that  the  closure  of  the  qual  graph  6  is  the 
same  as  G 

They  proved  that  semijoins  are  powerful  enough  to  solve 
chain  queries.  They  are  the  only  authors  who  present  a 
syntatic  characterization  of  chain  queries  and  derive  an 
efficient  dynamic  programming  algorithm  that  translates  any 
chain  query  Q  into  an  optimal  sequence  of  semijoins  that 
compute  Q.  This  algorithm  has  0(n3)  time  complexity,  where 
n  is  the  number  of  relations  referenced  by  Q.  They 
generalize  this  approach  to  a  larger  class  of  queries  called 
tree  queries  whose  answer  is  a  subset  of  one  relation  in  the 
database,  and  develop  a  methodology  for  optimally  solving 
this  class  of  tree  queries.  However,  the  syntatic 
charaterization  becomes  more  complicated.  No  timing  analysis 
is  presented  for  the  tree  query  case. 

2.3.5  Summary 

Most  of  the  distributed  query  processing  algorithms 

developed  to  date  have  th4  following  common  features: 

/ 

1.  single  query  processing 

i 

Most  of  the  algorithms  only  consider  the  optimization 
of  the  processing  of  one  query,  as  if  DBMSs  were  a  single 
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user  system.  In  fact,  every  node  in  DBMSs  is  simultaneously 
an  access  point  of  DBMSs.  Moreover,  every  node  has  the 
ability  to  do  query  decomposition  and  to  process  relational 
operators.  It  is  desired  to  have  an  optimal  processing 
algorithm  for  the  set  of  queries  which  are  in  the  system  at 
a  given  time  interval. 

2.  communication  cost  dominance 

The  inter-system  communication  time  is  the  dominant 
cost  of  distributed  query  processing.  The  communication  time 
is  dependent  on  the  volume  of  data  to  be  transmitted.  The 
total  transmission  time  of  a  query  is  proportional  to  the 
amount  of  data  and  messages  required  to  be  transmitted.  Most 
of  the  algorithms  consider  only  the  transmission  cost.  It 
would  be  desirable  to  develop  algorithms  that  consider  local 
processing  costs  as  well.  Another  direction  would  be  to 
select  a  better  execution  sequence  for  semi join  and  join 
operations  and  exploit  the  feature  of  parallelism  of  data 
transmissions  over  links  and  local  processings. 

3.  heuristic  algorithm 

For  each  query  processing  strategy,  the  costs  of  one 
step  in  the  execution  depend  on  previous  steps.  The  set  of 
strategy  space  blows  up  very  quickly  as  the  number  of  steps 
increase.  This  suggests  that  the  distributed  query 
processing  problem  may  inherently  be  a  complex  problem.  Most 
of  the  distributed  query  processing  algorithms  are 
heuristics.  [CHIU  79]  is  the  only  one  who  studies  the 


syntatic  characteristics  of  semijoin  programs  for  a  class  of 
chain  queries  in  order  to  reduce  the  set  of  strategy  space. 
It  is  our  desire  to  study  the  complexity  of  the  distributed 
query  processing  problem.  We  are  also  interested  in  the 
understanding  of  quantitative  characteristics  of  this 
problem. 

2.4  A  Model  for  Equi-join  Query  Processing 

As  we  reviewed  in  the  last  section,  most  of  the 
distributed  query  processing  algorithms  proposed  have  the 
common  philosophy  of  performing  local  processing  first,  then 
applying  as  many  semi joins  as  possible  to  reduce  the 
database  state  as  much  as  possible  and  then  sending  to  the 
final  site  to  perform  the  join  and  produce  the  final 
results.  The  reason  for  doing  so  is  presumably  that  the 
semijoin  tactic  will  be  profitable. 

As  described  in  [DLL  80],  there  are  some  rules  which 
may  be  used  to  help  "optimize”  relational  expressions, 
although  these  rules  in  no  sense  guarantee  optimal  overall 
equivalent  expressions.  The  basic  idea  is  to  attempt  to 
perform  selections  and  projections  as  early  as  possible. 

For  a  query  Q«(q,TL) ,  let  {  R,  ,  R2,  ...,  R^  }  be  the 
set  of  relation  schemas  referenced  by  q  and  let  X  be  the  set 
of  attributes  appearing  in  q.  Before  processing  the  query, 
we  can  project  each  relation  R^  over  attributes  (X(JTL)nR;. 
We  then  execute  those  subqueries  which  reference  only  one 
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local  relation.  We  may  also  want  a  cascade  of  those 
operations  to  be  organized  into  one  selection  followed  by 
one  projection  and  group  selections  and  projections  with  the 
preceding  binary  operation.  From  here  on,  we  represent  R,  ,RA 
,,..,Raas  the  relations  after  such  preprocessing. 

In  a  distributed  query  processing  environment,  if  we 
adopt  the  assumption  that  the  data  communication  cost 
dominates  the  local  processing  cost,  then  the  local 
processing  costs  of  a  query  (e.g.  select,  project)  are 
negligible.  The  only  significant  cost  needed  to  be 
considered  is  the  data  transmission  cost.  Data  transmission 
is  incurred  when  two  relations  that  must  be  joined  may 
reside  at  different  sites.  To  perform  the  join,  one  way  is 
to  move  the  entire  relation  from  one  site  to  the  other.  The 
other  way  is  to  replace  a  join  by  performing  semijoins  first 
and  then  performing  join.  Assume  R(  and  R2  at  different 
sites  and  we  want  to  join  R(  and  R^  at  the  site  of  RA  .  By 
the  semijoin  strategy,  one  can  send  the  projection  of  Rx  on 
its  joining  columns  to  R,'s  site  and  perform  a  semi join  to 
reduce  R,  by  R2  before  sending  R,  to  Rx' s  site.  This  will  be 
a  profitable  tactic  only  when  the  projection  of  R^  on  its 
joining  columns  is  smaller  than  the  amount  by  which  R(  is 
reduced  by  the  semi join.  From  the  above  example  of  joining 
two  relations,  one  can  easily  be  convinced  that 
semi joins-then-joins  stategies  may  not  be  able  to  produce  an 
optimal  strategy  for  the  objectives  of  the  minimization  of 
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the  total  data  transmission  cost.  Our  approach  is  to  extend 
the  strategy  space  to  the  class  of  joins-semi joins-mixed 
strategies.  A  joins-semi joins-mixed  strategy  is  an  ordered 
sequence  of  join  and  semijoin  operations  where  join  and 
semi join  operations  intermingle  with  each  other,  (i.e.  The 
order  of  join  and  semijoin  operations  do  not  have  any 
restriction.) 

We  assume  that  a  query  Q,  specified  by  a  qualification 
q  over  the  relations  R, ,  R2,...,Rn,  and  by  a  target  list  TL, 
can  be  decomposed  into  a  set  of  operations  {  p  ,p , ...p  } 
which  will  produce  the  answer  to  the  query,  where  pK$j$,  the 
set  of  relational  algebra  operators.  In  general,  a  query  can 
be  decomposed  into  several  different  executing  sequences 
which  will  produce  the  same  answer.  We  call  such  an 
executing  sequence  a  strategy.  Let  S(Q)  denote  the  set  of 
strategies  which  answer  the  query  Q.  The  goal  of  the  problem 
is  to  minimize  the  overall  cost  of  executing  this  query  Q. 
We  can  formulate  this  problem  as 

X 

MIN  f(P,D[0])-  21  f.(p. ,D[i]) 

P6SW  a«I  *  A 

S.t.  P-P,P2 . Pjl 

D[i+13-P^(D[i3) 

D[0]  is  the  initial  database  state 
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Here  p^(D[i]>  means  the  mapping  from  the  temporary 
database  state  at  stage  i  by  the  operation  p^  to  a  new 
temporary  database  state.  During  the  analysis  of  a  query 
solution  strategy,  the  database  state  is  conceptually 
considered  changing  from  one  state  to  the  other  state  by  an 
operation.  The  real  database  state  stored  in  the  system  does 
not  change  unless  update  operations  are  performed. 

As  shown  in  [BG  81],  any  query  Q-(q,TL)  with  an 
equijoin  qualification  q  and  a  target  list  TL  can  be 
efficiently  transformed  by  renaming  attributes  of  the 
relation  schema  and  qualification  into  an  equivalent  natural 
join  query.  Instead  of  the  class  of  equijoin  queries,  EQJ, 
we  shall  study  the  class  of  natural  join  queries,  NJQ. 

In  this  section,  we  restrict  our  study  to  a  class  of  < 

queries  that  after  initial  local  processing  and  attribute 
renaming,  the  resulting  queries  are  natural  join  queries. 

Although  it  is  a  subset  of  the  complete  relational  calculus 
language,  it  is  a  rich  and  large  class  of  queries  in 
practice. 
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2.4.1  Definitions  and  Assumptions 

We  assume  a  distributed  database  management  system 
DDBMS  consists  of  a  collection  of  interconnected  computers  S, 

,  ,  ...,Snat  different  sites.  Bach  computer,  known  as  a 

node  in  the  network,  contains  a  DBMS.  Data  are  logically 
viewed  in  the  relational  model.  Without  loss  of  generality, 
we  assume  each  site  only  consists  of  one  relation.  In  the 
distributed  database  DD*{  D( ,  D^, . . . ,  D  n  } ,  where  each  D^ 
only  consists  of  one  relation  ,  we  shall  use  DD«{  D,  ,  Da 
,...,  Da  )  or  {  R| ,  Ra  ,...,Ra  }  interchangeably  when  no 
confusion  will  occur. 

Data  transmission  in  the  network  is  via  communication 
links.  We  assume  that  the  transmission  cost  to  send  one  byte 
of  data  between  any  two  sites  i  S>  j  is  known  and  equal  to  c>j 
.  Thus  the  cost  function  of  transmitting  data  of  volume  V 
between  two  sites  i  &  j  is  a  linear  function  C  (V)»c *j  *v. 
We  assume  that  all  possible  subqueries  involving  data  at  a 
single  site  are  preprocessed;  This  we  call  "local 

processing”.  The  effect  of  local  processing  is  to  reduce  the 
amount  of  data  that  needs  further  processing.  We  will 
regard  the  state  of  each  database  as  the  resulting  state  of 
the  database  after  local  processing.  Thus,  after  local 
processing,  the  following  parameters  of  the  qeury  can  be 
defined. 
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n  ■  number  of  sites  (i.e.  relations)  in  the  remaining 
query 

“  I  rl  I  *  number  of  attributes  in  site  R-t 

Y*:  »  R-  A  Rj  ,  the  set  of  attributes  of  joining  domains 

between  R.  &  R: 

<  i 

■  number  of  tuples  in  relation  R^ 
w ( A)  ■  the  width  of  data  item  of  attribute  A 
s?  ■  vi  *  11  w ( A)  ,  the  size  of  the  relation  R. 

A  w 

w(yLp-  ^  4  w(a) 

In  DDBMS ,  we  define  two  types  of  directed  operators. 


Definition: 


1.  < | X | . ^  (or  R.<|X|  Rj )  is  the  directed  natural  join 
operator  which  sends  R^  to  R^  and  performs  the 
natural  join  of  R*  and  R*  at  R:'s  site. 

<  f  * 


2.  <|X*^  (or 


RC  <|X 


"i  > 


is  the  directed  natural 

semijoin  operator  which  projects  'R-AR*  over  R^ 

,  sends  the  result  to  R^  and  performs  the  join  of  R^ 

and  that  result  at  R*’s  site.  (i.e.  R.  |  X  |  if  R  at  R. 

x  Ti^  x 

’s  site). 


Note  that  |X|>j*  «R^  |X|>R-  and  X|>.- »R.  X|>R.  are  similarly 
defined.  One  can  use  them  interchangeably.  The  semi  join 
operation  only  reduces  the  relation  state  without  changing 
the  relation  schema. 
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Definition: 

A  join-semi  join  program  P»p  p  .  ..p,  is  a  sequence  of 
directed  natural  join  and  directed  natural  semijoin 
operators. 

A  natural  join  qualification  q  vith  final  node  at  R, 
can  be  done  by  sending  all  relations  R^ ,  itl,  to  R,  and 
performing  R,  |X|R4 |X| . . .  |X|Ra  at  node  R,  .  So  Ra  |X|>R,  ,  Rj 
| X } >R ,  ,...  R^  |X|>R,  or  its  permutation  are  join-semi join 
programs  of  this  qualification  q. 

2.4.2  Query  Processing  Graph 

We  define  a  processing  graph  of  a  qualification  over  a 
database  schema  DD»{R^  j!^to  be  a  graph  with  two  types  of 
edges, <\^  ,AlfB4>.  V4  is  the  set  of  nodes,  which  is  equal  to 

D.  Ag  is  a  set  of  semi  join  edges  which  is  {a  ••  |  R.A  R.  and 

f  » 

We  denote  such  an  edge  by  i  — > —  j  with  one  arrow 
on  the  edge.  |  i4j  }  is  the  set  of  join  edges. 

We  denote  such  an  edge  by  i  — » —  j  with  two  arrows  on  the 
edge. 


Note  that  if  R*trt  Rj  «/T,  then  we  can  not  perform  a 

semi  join  between  R^and  Rj  ,  so  a.  j  is  not  a  semi  join  edge.  If 

R^  £  Rj  t  then  R^-R^flRj.  The  semi  join  of  R^  to  Rj,  R.X|>Rj, 

is  the  same  as  the  join  of  R«  to  R • ,  R^ |  X  |  R  • .  This  operation 

is  covered  by  join  edge  b;*  . 

4 
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Example: 

® »  “  {  A  | ,  A^  ,  Aj  ,  A^} 

^i“{  ^  '^*A^  'Aj] 

V<  VW 

The  processing  graph  of  the  natural  join  qualification  q  is: 


Without  lost  of  generality,  from  now  on  we  assume  that 
the  final  node  of  a  query  is  node  1. 

Definition: 

Given  a  natural  join  qualification  q  fa 
join-semi join  program  P  is  said  to  be  correct  with 
respect  to  q  if  after  executing  the  program  p,  the 
final  node  will  have  a  new  relation  r;  ■  R, |X|Ra 
|S|...|I|Rn. 

Lemma  1: 

A  join-semi  join  program  consisting  of  a  directed 
path  of  edges  in  B,  from  R  to  R  ,  bk  ,b..  „  . . .  ,b 
will  form  a  relation  R  |X|R.  |Z|...|Z|R  in  node  R„ 

Proof:  We  prove  this  by  induction  on  the  length  of  the 
path.  If  1»1,  then  the  path  is  b.  '  .  After  this 

Kf  n  | 
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operation  we  will  have  IT  < -  Rk  IXIR*  .  By  the 

induction  assumption  on  1-1,  R'  < - R„  IXIR., 

|X|...jX|R  .  In  the  case  of  1,  for  the  first  1-1 

edges  of  this  path,  R'  -R„  |X|R„  |X|...  | X | R by 

*X-\  l>o  N  *  *t-\ 

induction  assumption.  After  performing  bK  K  ,  we 

r  }  X 

will  have  R^  «R^  lxlRKjt  "RkJX,V  lXl  *  *  *  lxlRkt  • 


Definition: 


Given  a  directed  spanning  tree  T  toward  final  node  R 
,  a  program  of  operations  in  ,  btc,Kl  'bK»lV  *  •  *  'b 
is  said  to  associate  with  T  if  each  directed  path  in 
the  directed  spanning  tree  has  the  same  ordering  as 
the  subsequence  of  corresponding  operations  in  the 
program. 


Example:  In  the  following  directed  spanning  tree  T0  toward 
node  1,  by^bi4b^b+l  fc^a  ba|  is  a  program  associated 
with  T e  and  bJa  b^  b^  by^ b^  b^(  is  another  program 
associated  with  T.  . 
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Lemma  2: 

A  program  with  edges  in  associated  with  a 

directed  spanning  tree  with  nodes  k,  ,  kx  , ...,ka 
and  toward  node  1  will  form  the  joining  of  ,RK^ 

,  •  •  •  ,R,#  ,R  ,  (i.e.  I  X|  R  |Z|  ...  |X|R  |X|R  )  at  node 
1. 

Proof:  we  prove  this  by  induction  on  the  height  of  the 
spanning  tree.  If  h-2,  then  the  resulting  relation 
at  node  1  is  the  joining  of  R  with  all  leaf 
relations  which  is  Rf<  |  X  |  R^J  X  | . . .  | X | R^^i ^ t R i  at  node 
1  .(see  Figure)  By  the  induction  assumption  on  h-1, 
the  resulting  relation  at  the  final  node  1  is  the 
joining  of  all  the  relations  in  the  nodes  of  the 
tree.  In  the  case  of  h,  each  node  at  level  1  has 
height  h-1,  and  the  resulting  relation  at  these 
level  1  nodes  is  the  joining  of  all  the  relations  in 
the  nodes  of  the  corresponding  subtree  above  that 
node.  Consider  the  final  node  1;  it  will  join  all 
the  resulting  relations  in  the  level  1  nodes  of  node 
1  with  the  relation  R,  .  Because  a  directed 
spanning  tree  will  contain  each  node  k  exactly 
once,  we  will  obtain  the  relation  R^IXjR^JXj . . .  | X | RK 
I X  |  R  |  at  node  1  .  |w| 

f  3 


0 
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Theorem  1: 

Let  Q«(q,TL)  be  a  natural  join  query  and  TL«R,  U...U 
Ra  .  Let  P  be  a  join-semi join  program  for  q;  then  P 
is  correct  for  q  iff  there  exists  an  ordered  subset 
of  the  set  ]  in  P  which  associates  with  a 
directed  spanning  tree  toward  node  Rt. 


Proof: 


IP:  Since  a  natural  semijoin  from  relation  R  to  S 

only  reduces  relation  state  s  to  a  new  state 
0 

consisting  of  tuples  with  values  in  the  columns  of 
joining  attributes  appearing  in  both  r  and  s,  it 
does  not  change  the  relation  schema  S.  Thus  after 
performing  the  sequence  of  join  opeations  associated 
with  the  directed  spanning  tree  toward  R, ,  we  get  R, 
|Z| . . . |Z|Rn  at  node  R|.  Any  other  join  operation 
does  not  change  the  state.  This  implies  P  is 
correct  for  q. 

ONLY  IP:  Let  P  be  a  correct  program  for  P.  For 
each  semi  join  operation  ag,  in  P,  R^n  R^f/T  and  R^ 
.  Thus,  performing  the  semijoin  operation  does  not 
move  the  full  relation  state  from  node  R^  to  node  R- 

Q 

.  We  still  need  to  perform  a  join  to  move  the  full 


table  of  R •  to  R;  .  If  there  does  not  exist  a  subset 

r 

of  {bjH  in  P  which  form  a  directed  spanning  tree 
0 

toward  node  R, ,  then  there  is  some  node  RK  which  is 


disconnected  from  the  tree  component.  Then  some 


information  from  those  nodes  which  do  not  have  a 
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path  toward  Rf  is  lost.  So,  in  order  to  form  R, 
|X|RZ  |Z|...  |X|Rn.,|X|Rnat  node  R,  ,  there  must  be  a 
subset  of  edges  in  Ba  that  forms  a  directed 
spanning  tree  toward  node  R, . 

From  theorem  1,  we  know  that  given  a  NJQ  qualification 
q,  the  set  of  correct  programs  for  q  is  the  set  of 
join-semi join  programs  such  that  there  exists  a  directed 
spanning  tree  toward  R  out  of  the  set  of  join  edges  in  P. 
We  denote  this  set  of  correct  programs  for  q  by  ^P(q).  In 
this  thesis,  we  restrict  the  problem  by  only  looking  for  a 
best  program  within  this  class  of  programs.  The  distributed 
query  processing  problem  becomes  to  find  a  program  P  €  ^P(q) 
with  minimum  communication  cost.  For  a  program  p,  if  we 
change  the  order  of  the  sequence  of  operations,  the  total 
communication  cost  will  be  different.  The  set  of  correct 
programs (P(q)  is  very  large.  In  fact,  after  executing  one 
operation  in  P,  the  number  of  rows  and  columns  of  some 
relations  will  be  changed.  This  change  then  affects  the 
communication  cost  of  the  next  operation.  So  the 
communication  cost  of  one  operation  will  depend  on  the 
previous  subsequence  of  operations. 

2.4.3  Estimate  the  Size  of  the  Derived  Relations 

In  order  to  compare  the  communication  cost  of  query 
processing  strategies,  it  is  very  important  to  have  a  method 
of  estimating  the  size  of  a  relation  after  one  operation. 
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Also  the  system  for  estimation  of  the  size  of  the  derived 
relation  must  be  consistent  in  the  sense  that  if  two 
sequences  of  operations  will  produce  the  same  results,  the 
estimated  sizes  of  the  result  according  to  the  two  sequences 
of  operations  must  be  the  same.  In  this  section,  we  use 
capital  R  to  represent  both  relation  state  and  relation 
schema . 


We  introduce  the  notion  of  semijoin  reducibility  and 
join  reducibility  of  to  R^  ,  denoted  by  o^y  and  ^ 

respectively,  for  each  pair  of  relations  Rt  and  R-  ,  where 

w 

03  $1  and  03  £l.  The  interpretation  of  the  semi  join 

reducibility  of  R-t  to  R;  is  the  percentage  of  rows  of  Rj 
that  are  eliminated  after  performing  the  semi join  R*  X|>  R.  . 

J 

At  stage  t,  if  the  number  of  rows  of  Rj  is  Vj [t-1]  and  the 

semi join  reducibility  of  R*fc  to  Rj  is  [t-1],  then  the 

number  of  rows  of  R  after  performing  semi join  R.  X|>  R.will 

^  J 

be  reduced  to  Vj  [t]»  Vj  [t-1]*  (1- o^,y[t-l] ) .  Note  that  the 
semijoin  reducibility  of  R.  to  Rj  is  not  necessarily  equal 
to  the  semijoin  reducibility  of  Rj  to  R*  and  o^;c[t]*0  for 
all  t.  The  interpretation  of  the  join  reducibility  of  R  •  to 


Rj  is  that  after  performing  join  R.  |X|>  R • ,  the  number  of 
rows  of  the  new  relation  R*jX|Rj  at  site  j  will  be  v. [t]*  v. 
[t-1]*  v.  [t-1]  *  U-  «4j[t-l])  *  (l-ojftlt-lj)  *  (l-fy[t-l]>. 
This  is  because  the  effect  of  join  R>  |X|>  Rj  is  equivalent 
to  performing  the  semi  joins  R;X|>  Rj  and  R.X(>  Rjand  then 
performing  the  join  of  R  •  to  Rj  .  Both  semi  join 
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reduciblities  and  join  reducibility  affect  the  number  of 

rows  of  the  new  relation.  The  join  reducibility  of  R  .  to  R. 

0  J 

is  the  same  as  the  join  reducibility  of  R  j  to  R  ^  .  i.e. 
^..[t]  ■  ^[t].  Also  jj..[t]«0  for  all  t. 

For  this  paper,  we  assume  that  the  number  of  tuples  of 
relation  R*  in  the  system,  v^  ,  and  the  set  of 
reducibilities  {  o^j  ,  }  of  each  pair  of  relations  are 

known.  Note  that  these  quantities  depend  on  the  initial 
local  processing  and  attribute  renaming  process  of  any  given 
query.  After  the  systems  are  running,  they  can  be  updated 
periodically  according  to  statistical  measurements.  They 
will  form  the  basic  information  for  processing  a  given 
query. 


Since  the  number  of  rows  and  columns  of  a  relation  will 

be  changed  after  one  operation,  the  reducibilities  of  this 

relation  with  other  relations  will  be  changed  too.  We 

define  how  the  reducibilities  will  be  changed  after  one 

operation.  Assume  the  databases  before  the  operation  p^  to 

be  D«{Rf  [t-1] , . .  .R^  [t-1] } ,  the  number  of  rows  of  each 

relation  R.  [t-1]  to  be  v-[t-l],  and  the  semi  join  and  join 
k  * 

reducibilities  of  R. [t-1]  to  Rj[t-1]  to  be  o^jft-l]  and 

*  «  / 

If  the  operation  p^  at  stage  t  is  a.j  ,  i.e.  R.  X | >Rj  , 
then  the  database  schema  will  remain  the  same.  The  state  at 
node  j  will  change  to  Rj  [t]»R*  [t-1]  X|  Rj  [t-1]  and  all 
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other  states  will  remain  the  same,  i.e.  R  [t ]«R ^  [t-1 ] ,  V  k 
$  j.  The  number  of  rows  of  relation  Rj[t]  will  be  changed 
to  equal  v-  [t-1]  *  (1-  o^"{t-l] )  and  the  number  of  rows  of 

J  J 

all  other  relations  will  remain  the  same.  At  stage  t,  we 
have 


v- 

A 


V  . 

J 


[t]«V;[t-l]* 

(l-*Cj[t-l]) 


According  to  the  definition  of  reducibilities,  the  estimated 

size  of  R.  XI (R.  X I R ;  )  is 
At  ^  J 

vj[t]*(l-^..[t])  -  v.[t-l]Ml-o^[t-l]>  *  U“^[t]). 

Since  R.  X|  (R  •  X|R  ,  )  -R  .  Z| R ;  , 

-v.  [t-1]  *  (1-otjtt-l))  , 
which  implies  cfcj[t]»0. 

The  estimated  size  of  (R*  XjRj)  X|R^  is 
v^tt]  *  (1-  o^ittj)  -  v^[t-l]  *  <l-o$t[t]). 

Since  (R-  X (Rj )  Xj  R-  *  (Rj  X|R-  ), 
v.[t-l]  *(l-o<'.i[t])  -  vj[t-l]  *  <W;;[t-l]), 
which  implies  o^\[t]a  o^t[t-l]. 


Next,  we  consider  the  reducibilities  «^j[t]  and 
At  stage  t,  we  have 


v^tt-i] 


k 

I  **  I 


<*)*[*] 


[t].v;[t-l]* 

(1-  ofyt-1]) 
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Lemma:  R  ^  X  j  (R^  X  J  R  j  )  ■  R^  X|  (R ^  XjRj  ) 

Proof : 

RKX|(R.X|Rj) 

•{t€  Rj  J  t  £  Rl  X|Rj  igs^R^  ,  s.  t. 
t  [YjK]*SK[yjit]} 

»{t  4R.  |  3  S-iRj  &  3  S„€  RK  ,  s.  t. 

t  [Yjj  ]-S.[Y.j]  fc  t  [Y*.  J-SK[YKj]}  (*) 
Similarily,  we  have  R^  X{  (RK  X|Rj  )■  (*). 

Thus,  RK  X|  (Rt  X|Rj  )  -  R  .  X|  (RK  X|Rj) . 

The  size  of  R K  X| (R^  X|Rj )  is 
and  the  size  of  R^  X|(RtcX|Rj-)  is 

By  the  above  lemma,  they  should  be  equal. 

In  extreme  case,  o^[t]  could  be  either  0  or  1.  In 
general,  we  will  find  an  approximation  function  of^Kj[t] 
Let 


J<v 

■the 

set 

of 

values 

in 

columns 

V 

of 

R; 

WV 

■the 

set 

of 

values 

in 

columns 

of 

rk 

A 

)»the 

set 

of 

values 

in 

columns 

V 

of 

RC 

Here  we  assume  the  set  of  elements  in  J(Yj^  )  is  uniformly 
reduced  by  the  operation  a ,  i.e.  the  percentage  of 
elements  being  reduced  in  the  set  of  common  elements,  J(Y.K  )A 
K(Y-k  )  and  in  the  remaining  set  of  non-common  elements,  J(YjK 
)”R(Yj|fc)  in  J ( Yj^)  by  operation  a  ••  are  the  same.  Thus,  the 
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ratio  of  the  size  of  the  set,  J;(YjK)  MYjj*),  of  common 
elements  in  with  the  size  of  the  set,  ( Y j^c)  r  in 

(Yj|t  )  is  the  same  as  the  ratio  of  the  size  of  the  set,  J  (Y^ 
)AK(Yj|t),  of  common  elements  in  J  (Y jK)  with  the  size  of 
the  set,  J  (Yj,t  ) ,  in  J  (Y )  before  operation  ay  ,  i.e. 

— .  .  1J  1y;k  >  nR(5a  )i 

So  we  have 

O^nj  [  t  ]  *  [  t  ~  l  ] . 

This  implies  the  size  of  R  X|(R.  X|R; )  is 

ifc  A  J 

v.  [t-l]*(l“o^j[t“l])*tl-  WKj[t-l]). 


Following  the  same  assumption  as  above,  after  operation 
a.j  ,  the  number  of  common  elements  in  columns  Y^  of  R  •  X|R^ 
and  R ^  has  been  reduced  by  (l-o<cj  [t-1]).  After  performing 
semijoin  operation  (R  ^  XjRj  )X|>RK  ,  the  size  of  (R  i  X|Rj 
)  X  |  R^  w  i  1 1  be 

vK[t-l]*(l-^[t-l])*(l-W/>[t-l]). 

By  the  definition  of  semijoin  reducibility ,  the  size  of  (R. 
X|R. )X|Rkwill  be 

J  * 

VjJt-lJMl-ofjjJt]) 

where  ^[t]  is  the  semi join  reducibility  of  R;X|Rj  to  R^  . 
This  implies 

-  (1-  *<ij  [t-l])*(l-o^J6[t-l])  . 

Thus,  0^JK c t ] -  ^t-ll  +  ^tt-l]-  ^Ct-l]*<;.[t-l]. 
for  k  ♦  i, j. 
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For  the  join  reducibility  ^[t]  of  R.  X|Rj  with  R ^  , 

if  we  want  to  still  have  ^jK[t]  defined  by  pairwise  formula 
v.  [t]*  v|c[t]*(l-of;li[t])*(l-<;[t])*(l-f.J{[t3), 
then  we  must  have 


We  will  discuss  the  reasons  later.  Here  we  assume  ^[t]  * 
max{  1-(1-  jJj|l[t-l])/(l-  e<ij  [t-l])r  0}. 


Next,  we  consider 

stage  t,  we  have  R*  X|R- 

J 


reducibility  ^j[t]. 

at  node  i  and  R  .  X | R; 

^  '  J 


Suppose  at 
at  node  j. 


j 


vc[t] 


Rt  X|Rj 


VJ  [t]»v-[t-l]* 
1 


then  oiij  tt]*0,  v. [t)-v{ [t-1]*  (1-  U)i  [t-1] ) 

and  Vj [t]-vj [t-1]*  (1-  «6j[t-l] ) . 

Since  (R.  X|R*)  |X|  (R*  X|R;)  »  R.  |X|R*  ,  it  follows  that 
v£  [t-1]*  (1-  C<j;[t-1])  *  V.[t-1]*  (1-o/jjtt-l])  *  (l-pCj[t-l]) 

-  V.  [t-1]*  d-0<;i[t-l])  *  V^[t-1]*  (l-.fy[t-l])  *  (l-^[t]). 
Thus  pij[t]«  pCj[t-13. 

Figure  2.1  illustrate  these  changing  rules.  We  summarize 
the  reducibilities  changing. rules  after  semijoin  operation  a 
in  the  following: 


Ucj  [t]».  o 

o^[t]-  o/jKCt-1]+^j[t“l]-o<;it[t-l]*#<^[t-l]  V  kf  i,j. 
<^[*3-  ^flCt-l]  otherwise. 
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fJ|c[t]-  max{  1-(1-  ftKCt-l])/(l-  o&j  Ct-1 3 ) ,  0}.  V  k*  i,j. 
ht  t-1]  otherwise. 


If  the  operation  at  stage  t  is  bf.  ,  i.e.  R^  |X|>Rj  , 

then  the  database  schema  at  node  j,  Ry[t]  will  change  to  R^ 

[t-1]  U  R. [t-1],  and  the  relation  state  at  site  j  will  be  R. 
J 

[t-1]  I X I  R.rt-1).  The  number  of  rows  of  R.  [t],  v.[t],  is 
J  J  * 

V.  [t-1]  *  v.  [t-1]  *  (1-  o<Cj  [t-1])  *(1-  *{jc  [t-1])  *  (1“ 

J  * 

All  other  relation  states  will  remain  the  same.  Because  this 
is  a  join  operation,  the  semijoin  reducibilities  and  join 
reducibilities  will  be  affected.  (See  Figure  2.2) 

At  stage  t,  we  have 


v-[t]« 

Vjft-1] 


*:(t]«vt[t-l]*v;  [t-1] 
J  *  (l“ofri[t-l]) 

*  (l-«Af[t-l]) 

*  d-^/tt-1]) 


According  to  the  definition  of  reducibilities,  the  estimated 
size  of  R-t  X|  (R.  JX|Rj  )  is 

vj[t]  *  (l-<j[t]). 

Since  RL  X|(R.  |X|Rj  )  -  Rt  |X|Rj  ,  this  implies 
1"  eiij  [t]-l,  i.e.  o4j[t]-0. 

Similarily,  the  estimated  size  of  (R  •  | X | Rj  )  XjR-  is 
v.[t]  *  (1-o^itt])  -  v.  [t-1]  *  (1-^-iCt)). 


59 


Since  (RjJX|Rj  )  X|R;  -  R  j  X|R.  , 

v.[t-l]  *  (1-o/jitt])  -  Vjtt-1]  *  (l-«£f[t-l]) 
which  implies  o^f[t]«  otji  [t-1]. 

Lemma:  (R  •  |X|Rj  )  X|  R^  CR.X|  (K.  X|  R^  ) 

with  equality  when  Y{*  £  Y^  fl  Yjk  . 

Proof:  Let  Y{j>  «Y Cj  n  Y Jlcn  Y;|t  .  Then 
(R  .  I X | R j  )  X |  R^ 

■{t^Rjj  |  3  Se  €  R^  |X|Rj  5s,[Y.KaYjK]  -  tfY^  Yj(,  ] } 

* { t*R|c  |  3  s,  €  R.  &  s  €  R.  s.  t. 

J 

1  ■  *,1*1*  ]  i 
U*  >  1  -  »*[*;*  ]  4 
M*tj  ]  ■  Sst*;j  ]  }• 

Also 

R-k  *1  (Rj  *1  R*  ) 

-{t«RK  |  3  SjfiRj  >  t[TJlt  ]  -  s,[IjK  ]  t 
s4«  R-  3  t[Y(*  ]  -  S4ITilt  ]  ). 

Thus  (R.  |X|Rj  )  X|  R^  6  R;  X|  (R  X|  R  ), 

If  YiJ- Wen  8 1  [ Y  ij  3  «  t[YCj  ]  -  sa[Yi;-  ]  . 

This  implies  equality  of  relation  states  when  Yc:  £  Y. 

J  ^ 

"Yj*  • 

Because  of  the  above  lemma,  we  approximate  the  number 
of  rows  of  (R  ^  |X|Rj  )  X|  R^  by  the  number  of  rows  of  R. 

X|  (R-  X|  RK  ).  If  at  stage  t,  we  have 
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j 


vL[t]«  I  Rl  | 

L[t-1]  . 


the  joining  domain  of  R  .  jX|  Rj  with  R^  will  be  Y.^U  Y^  . 
Since  vK[t]  * 

«  vjt-l]  *  (l-^K[t-l])  *  (l-o$K[t-l]) 
implies  (1-^lt])  -  (1-  Jctlt-1])  *  (W;K[t-l]), 

the  semijoin  reducibility  of  R  •  |X|  R ;•  with  R^  will  be 

efjjctt]-  +  It_1l  -  <^iK  Ct-1]  *  ^[t-l]. 

Lemma:  R^  X|  (R  .  |X|  Rj  )  C  (R^  X|  R^ )  |X|  (R ^  X|  R j  ) 

with  equality  if  Y^K  -Y -k  . 

Proof : 

RkX|  (R.  | X |  R j  ) 

-{t|  t  £  (R.  | X |  Rj  )  6  3  S0£Rk  s.t. 

*0  1  '  J) 

»{t|  3  3(€  R^j  ,  €  R  j  »8(6  R|j  S.t. 

S,  -t[Rt  ]  t  s2  “t[R j  ]  6 

»,  I 3  -  t[ Yf|t  ]  -  Sc  [Y.m  ] 

Sa  [Yjk  3  -  t[Y.K  J  -  So  [YJk  ]}. 


Also 
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(R  K  X|  R.)  |X|  (R  K  X|  *.  ) 

-{ 1 1  a  Sj  €  R*  X|R.W  ,s4«  Rk  X|Rj  s.t. 

S,  -t[R.  ]  &  s4  *t [R j  ]  } 

■{t|gs3«Rt  »s4«Rj  »Sy  ,  s  ^  4  R^  s.t. 
s3  -t[Ri  ]  &  S<f  «t[Rj  ]  & 

1  ’  ‘t**,  1  -  ss  tT{K  ] 

*4tTjK  3  -  ^jK  1  -  *t  tV  11 

So  we  have 

RKX|  (R.  |X|  a.  )  £  (RK  X|  r j )  |X|  (rk  x|  r^  ). 

Note  that  equality  holds  when  Y  “Y^  .  Similarily,  we 
approximate  the  size  of  R*X|  <R  .  |X|  R  .  )  by  the  size  of 

(R^  X|  R£)  |X|  (R„  X|  Rj  ). 

Since  the  reducibilities  between  R^X| R •  and  R^XfRj  are 
as  follow: 

[  t  W 1- ^  [  t-1  J )  *  <  1- o<c;  [  t-1  ] ) 


RKX|Ri 


-»■ 


1-  fol 


The  size  of  (R^XIR.)  |X|  (R^XIRy)  will  be 

v.[t]*vj[t]*(l-^[t])*(l-^i[t])*(l-^[t]) 

-  V.[t-1]  *  (l-ofott-lj)  *  Vj-  [t-1]  *  (1-^Kjit-l]) 
*  [t-ij)  *  <Wjitt-i])  *  . 

If  at  stage  t,  we  have 
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9 

S 

i 


■4 

£ 


i 


V; [t J»Vj [trl j*v*[t-l ] 

*  (1-  oi{j[t-i)) 

*  (l-cfo[t-l]) 

*  U-fo[t-l]) 


By  definition,  the  size  of  R^X | (R^ | X | Rj  )  is 
VJ-[t]*(l-0^j[t]).  SO  Vj  £ 1 1  *  (1-ofytt]) 

-  v.[t-l]  *  (1-^itt-l])  *  v.[t-l]  * 

*  (1-o/c;  [t-1])  *  (l-o<j4-[t-l])  *  U-^y[t-l])  , 

Since  Vj  [t] 

-  Vjtt-l]*  v^lt-1)*  *  (l-rffitt-l)) 

*  d-  ftj  [t-1]) 

implies  .  (l-^tt-l])  *  (1-  Wiytt-1] ) ,  so  the 

semi  join  reducibilities  of  R{  | X |  r  .  with  R^  will  be 

°^j  tt-1]  ♦  o(Hi  [t-1]  -  o(fcj  1 1 ”  1  ]  *  olki  [t-1]. 


We  summarize  the  changing  rules  of  semijoin 
reducibility  of  Rh(t]  to  R^t]  after  join  operation  bLj  as 
follows:  (also  see  figure  2.2) 


0 

»4,t  t-i) 

»<'w.tt-lJ*<K[t-l]-0^[t-i]*^ill[t-i] 

Ck«tt-l)*o4;  [t-1]-  <4,1 1-1]*  <4  [t-1] 
hmtt-1] 


h«i;k-j 
h» j ; k«i 

h*j?)c^i,  j 

^■jr'h^i,  j 
otherwise 


x 
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Next,  we  consider  the  changing  rules  of  join 

reducibilities  after  operation  p^  »bjj  .(see  figure  2.2) 
After  performing  R.  |X|>  R j  (i^e.  b.j  )  at  stage  t-1,  the 
state  at  node  i  is  R.[t]«  R.[t-1].  The  state  at  node  j  is  R- 
[t-1]  |X|Rj  [ t-1] .  eiij  [t]  changes  to  0  and  is  the  same 

as  [t-1  ] .  Assume  [t]  changes  from  ^.[t-1].  We 

illustrate  in  the  following  figure. 

At  stage  t. 


t]-0 


Figure  2.3  reducibilities  after  join  operation  R. |X|>Rj 

If  we  perform  join  b..  again  at  a  future  stage,  the 

resulting  state  at  node  j  is  the  same  because  R. |X| (R. |X|Rj ) 

■  R.  IX I R.  .  Correspondingly,  the  estimated  sizes  of  the 
v  '  1  J 

derived  relations  must  be  the  same. 

That  is  v.[tj*  Vj[t]*  (1-etyt))*  U-o^tt])*  (l-fgtt])  -  v. 
[tj. 

Because  ^ij[t]-0  and  o^tt]-  o^ft-l],  we  have 
v.[t]*  (l-<%tt-l])*  -1 

and  resulting  ^[t]  *  * “  d"  e^£[t-l] ) ) 

-  1-  l/(v{[t-l]*  (1- •fjiEt-l])), 
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if  vj[t-l]*  (1-  W;j[t-1])  f  0. 
If  v^t-1]*  (1-  e^c[t“l] )»0 f  then  we  set  ^£j[t]-l. 
Thus  we  have 


l/(vt[t-l]*(l-o^[t-l])) 


1 


if  v.[t-l]*(l-o$t[t-l])*0. 
if  v.[t-l]*(l-^.ctt-l])-0. 


Next,  we  consider  the  changing  rule  of  join  reducibility 
J.  [t].  We  assume  at  stage  t, 


We  first  look  at  another  way  of  interpreting  join 

reducibility.  Let  7  be  the  set  of  common  attributes  of  R; 

and  Rj  ,  W  be  the  set  of  attributes  in  R  i  -7  and  Z  be  the 

set  of  attributes  in  Rj  -7,  i.e.  R  j  »{7,W)  and  Rj-«{7,Z}. 

After  two  semi join  operations  a  ^ .  and  aj.  ,  we  assume 

M  ■  the  number  of  common  elements  in  7  columns. 

Associated  with  each  common  element  y p  in  7  columns,  let 

N£p«the  number  of  w  in  columns  W  for  yp  in  , 

Njp»the  number  of  z  in  columns  Z  for  y?  in  Rj  . 

M 

Thus,  the  size  of  R-k  ,  v.  ■  S  N and  the  size  of  Rj  ,  Vj 
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*  N  .  if  we  join  R*  with  R;  ,  R»  I XIR*  ,  the  actual 

p-i  r  J  1  j  '  '  j 

size  of  the  relation  R.  I X 1 R •  will  be  Zl  *N»*  «  By  the 

1  J  fm  i  Jr 

definition  of  join  reducibility ,  the  size  of  R^ | X | R j  is 

<  £  ”{r>*(  £,  Nir»*(1-  Po  >• 

Thus,  we  have 

Py  *  <  £  Nif  >/<<  I;  "tf  )*<p|  Njp  >). 

If  N  -p  and  Njp  »N.£  for  all  p,  q,  then 

1-  pcj  «  1/M 

*  1/number  of  common  elements  in  Y  columns. 


Suppose  R  jj  is  another  relation  with  attributes  {Y,U}, 
i.e.  Y  •  j  *Yjj<  *Yijc  *Yr  and  has  the  same  set  of  common 
elements  in  columns  Y  as  of  Rj  and  Rj  .  After  performing  all 
possible  semi join  operations,  Rj,  Rj  and  R „  will  have  the 
same  M  common  elements  remain  in  columns  Y.  Let 


NKp*the  number  of  u  in  colimns  U  for  yf  in  RK  . 

M 

The  size  of  R  ^  will  be  Zj  NKp  .  If  we  join  R ^  |X|R^  |X|R( 

,  the  actual  size  of  R ^  |X|Rj  | X | R ^  will  be 

£  N  *r  *  H j,  *  Npp  .  U> 

If  we  join  R.  |X|R-  at  node  j,  the  size  of  R.  j X | R •  is 
M  J  1 

*Njp  .  Let  the  join  reducibility  of  R  ^  |X|Rj  with  R^  be 

Pjn  *  **  we  Ri  lx|Rj  *****  R|c  at  node  k,  by  definition 

we  have  the  size  of  R  .  IXIR?  |X|R„  be 
M  fc  M  J  * 


< "if  *  Nir  >*<  &  V*(1'  ft*  > 


Formulas  (1)  and  (2)  should  be  equal.  Thus,  we  have 

l-  ft;  ■  <  £  Nif*NJP*N**r)/((  £  "Cf *»;,)*(=  N„p  )). 

■  <  &  2  %)/ 
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<<  ft  Nff  >*>• 

If  N  .  *N^  ,  N  .p  *Nj4  ,  and  N^p  «N  ^  for  all  p  and  q,  then 

l-  fa 

<<$  >‘‘  ft 

■  <  $,  M  Hir  *^>/ 

((  £  Nip)*(  £  N^)*(2  2:  Nk-)). 

p»l  *T  pal  “  p*>  <Jr  pal  " 

*  M  *  (1-  (*C*  )  *  (1-  ). 

In  general,  we  let  M  *  mi£  {  1/(1-  Pap  )},  i.e.  the 
smallest  number  of  common  elements  in  joining  columns  of  all 


pair  of  relations  R  p  and  R^  .  Thus,  we  approximate  the  join 
reducibility  of  R-  | X ( R j  with  R^  by 

1-  M  *  (1-  fa)  *  (1-  ft*). 


Now  we  back  to  the  changing  rule  of  join  reducibility 
l:*[t]  after  semi  join  operation  a..  .  Following  the  assumption 

ja  * 

of  7  •  j  «Yjk  ,  if  we  want  to  still  have  t]  defined  by 
pairwise  formula 

then  1-  0,*|([t]  will  still  be  the  reciprocal  of  the  distinct 
elements  in  the  joining  columns,  and  after  a- 

1”  fjK  Ct J  «(1-  [t-l])/(l-  [t-1]).  Thus,  we 

approximate  the  join  reducibility  of  R«  X|Rj  with  R^  by 

-  «•*{  l-U-  (ktt-in/U-  Ui]  0). 

n 

In  the  case  of  n  relations  {R  •  }  ,  for  which  the 

semi  join  and  join  reducibilities  are  and  the 

number  of  rows  of  relation  R  »  is  v.  ,  by  the  above 

L  ^ 
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reducibility  changing  rules,  we  have  the  following  results. 

Lemma:  For  relations  R(  |X|R  a |X| . . . . |X|R p  at  node  p  and  R 
at  node  q,  let  ^  ^  and  be  their 

corresponding  semijoin  and  join  reducibilities.  They 
satisfy  the  following  formulas. 

(1)  l-  cfpl  -  7T  (1-  °Chj  > 

(2)  1-  ) 

(3)  1-  -  ««■-'>  .  £  (1-  |?M). 


R,|X|Ra|X|  ...  | X | Rp  | 


Proof:  Me  prove  this  lemma  by  induction  on  p. 

For  p*2,  by  the  assumption  of  semijoin  and  join 
reducibility  changing  rules,  we  have  •  4 


and  of  R  j 

1 X  |  R 

X 

1-  gJ  "  » 

TT 

h»l 

■ 

$ 

1 

H 

& 

l-  fa  • 

M  * 

r»i  k.i  rn* 

for  g  f  1,2. 

By  the  induction  assumption,  we  have  the  semijoin 
and  join  reducibilities  j  "  ,  and 

*N,i  rM,i 

of  R j  | X | RjJ X |  ...  | X ) Rp.,  with  R^  satisfy 

4*  *  St  (1-  > 

m  ft  W-fih  > 

i-  *  £  Cl-  fa  ) 


for  q  f  1,2 


For  the  case  of  p, 


R,|X|R*|X| 


Let  the  semi  join  and  join  reducibilities  of  R,  |X|R 
|X|  ...|X|RM  with  R  p  be  f  ^  and  ^ 
and  the  semijoin  and  join  reducibilities  of  R(  |X|R 
|X|  . . .  | X | R  ^  with  R^  be  ,  e^'p.|  an<^ 

.  By  the  induction  assumption,  they  satisfy 


S  (1-  •'<*  > 


*-  Vh  *  s  (1- j4fc  > 

^  U*.-  mM  *  (1-  ^1» 

and 

1_  °4m,V  ‘  *£  u"  *^r  > 

“  •  g  (1‘  Vk  » 

ff/fH  *  *  Z  (1‘  ’ 

for  q  f  1 f  2  ^ • i p* 1 • 

Also,  we  know  the  semijoin  and  join  reducibilities 
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of  Rp  and  R^be  ,  e^p  and  .  If  we  perform 


join  operation  b, 


P-i,p  ' 


x.e. 


(R,  |X|Ra  |X| 


I X I R^, 


) | X 1 >  Rp  ,  then  by  the  assumption  of  reducibility 


chang 

ing 

rules, 

we 

have  the 

semi  join 

and  join 

reduc 

ibilities 

,  and  a" 

*1?  “pi. 

of  R, | X | 

|X|  • 

•  •  1- 

*lRp 

at 

node 

p  and  R^ 

at  node  q 

satisfy 

1- 

•4t 

■ 

(1- 

m 

>*(1' 

•to  > 

m 

f r 

h*l 

(1-  o/hl  ) 

1- 

m 

(1- 

O 

«£m  ,*<1- 

tor  > 

m 

t 

(1-  ofjj,  ) 

1- 

ft* 

m 

M  * 

<;•  &« » 

*U-  ) 

• 

m 

m 

M  * 

mm 

mm  *  ft  <1 

p  h»i 

*  (1- 
b*i 

- 

t  .1  > 

•ft*> 

for  q 

^  1,2,..., 

p. 

The  lemma  follows. 

Lemma:  The  estimated  size  of  relation  R  ^  |X|R^  |X|  ...  |X|Rn 
is 

n  n  h. 

M  1  *  B  %  *  JE,  <x-  ?»»• 

hCK 

Proof:  We  prove  this  lemma  by  induction  on  n. 

For  n«2,  by  the  definition,  the  size  of  relation  . R ^ 
jXJR^  is 

vl*vi*(l-04|X)*(l-  >M1-  ^). 

By  the  induction  assumption,  we  have  the  size  of  R, 
1*1^  1*1  •••  I  *  I  Rm-|  be 
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>H1  n-i  n-i 

m  *  n7,vh  fa>- 

h*fc  H<K 

For  the  case  of  nf  by  the  above  lemma  and  induction 
assumption,  we  have  the  size  of  R^  |X|Ra  |X|  ... 
|X|R„.|  be 

^  h-l  n-i 

M  *  ff,  vh  *  &  (1‘  **  >*  tf*  (1'  fa  > ' 

h*K  K«K 

and  the  size  of  R  n  be  .  The  semi  join  and  join 
reducibilities  ^  ^  end  ^  of  R,  |X|B, 

|  X  |  .  ..|X|R^  at  node  n-1  and  Rn  at  node  n  satisfy 

*-  -v*  ■  si; (i-  ^ ) 

1-  *  5?  (1-  -<nk> 

n*»  n-i 

l'  Pn.'n..-  MM  *  Za'  fa  > 

Thus,  if  we  join  R(  |X|Rx  |X|  ...  |X|RhHwith  Rh  , 
the  size  of  the  resulting  relation  R  (  JX|Ra  |X|  ... 
|X|Rn  will  be 

hh  n-i  n-i 

•** *  £  vi,  *  J-i  <J-  »<•.»  >*  fa  > 

h#K  h<K 

*  vn  *  >  *  «*-  )  *  (1-  fa,  ) 

teaaa  *  «  « 

■ M  x  Wh*  >*  jl,  (i-  fa  >• 

The  lemma  follows.  ***  *1<K 


we  summarize  the  changing  rules  of  join  reducibilities 
of  R^[t]  to  R^[tJ  after  operation  p^  »b{.  as  follows: 

^[t]-  V  k+i,j 


ftjttj-fl-  V(vc[t-l]*(l-^[t-l]))  if  v.[t-l]*(l-^lt-l 
{  1  if  v.  [t-l]*(l-<^[t-l 


3  )-0, 
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^[t otherwise 

v  h,lt 

Theorem: 

0Se4,y[t]Sl  and  0<  ^ky[t]<l,  for  all  i  and  j  and  any 
time  t. 

Proof : 

If  0Sa*l  and  0<b$l  then  0$ (a-1) (b-l)sl  implies 
QSa+b-a  •  b^l.  We  prove  this  theorem  by  induction  on 
t.  At  t«0,  the  initial  values  ©^[O]  and  |J,y[0]  are 

set  to  between  0  and  1.  By  the  induction  assumption 

at  t-1,  we  have  0*  i(^[t-l]s  1  and  fyi t-l]S  1. 

At  stage  t,  if  ©^[t]  is  equal  to  0  or  o4j[t-l],  o.r 
p;j[t]  is  equal  to  1  or  jS^[t-l],  then  they  still 
between  0  and  1. 

If  ctcj  [t]-  oLi)  C t-1  ]  ♦  o^CitCt-1]  -  * 

of^Ct-l]  then  by  the  induction  assumption  we  have  OS 

o^-[t-l]S  1  and  0S«^[t-lJS  1, 

which  implies  OS  ^4ylt]<  1. 

For  fci<[t-l])*(l-  fjic  [t-1] ) ,  because 

M-min{l-(l- ^w[0])}  implies  l/M-max{  (1-  ^[0] }  ,  thus 
we  have  M*(l-  fff[t-l])Sl.  This  implies  0<  M*(l- 
(^[t-l])<lr  i.e.  Os [t]^l. 

If  v.  [t-l]*(l-^.i{t-l])^  0,  then  v.  [t-l]*(l- rf;f[t-l] ) 
is  greater  than  or  equal  to  1. 

Thus,  Pyl t]»  1-  l/(v.  [t-l]*(l-a/.[t-l]) )  is  between 


72 


0  an  1.  These  prove  0£o^j[tj£l  and  0$  p -[t ]<1 .  The 
theorem  follows. 

We  note  to  readers  that  for  the  case  of  three  relations 
R , ,  Ra  and  R3  having  the  same  joining  columns,  i.e.  Y*Y|Z  »Y(J 
■y*3  »  &ndL  aJ(  a(i  aj4  will  actually  produce  the  same 

results  at  node  2.  But,  under  this  model,  the  estimated 
sizes  of  that  result  by  following  these  two  strategies  are 
different.  In  general,  the  strategy  aJ(  a|Zaja  is  more  time 
comsuming  than  the  strategy  aJ(  a(Jandjjsually  we  do  not  use 
it. 


Since  the  semijoin  of  R.X|>  R*  requires  the  projection 
of  R-over  Y..  and  sending  the  result  to  to  node  j,  the 
projection  of  R.  over  R.fl  Rj  may  reduce  the  number  of  rows 
of  R-by  eliminating  multiple  copies  of  tuples  that  are  the 
same  over  R^ARj .  We  want  to  estimate  the  number  of  tuples 
of  a  relation  after  projection. 

Let  W  be  the  set  of  attributes  in  Rj-Yjj  and  Z  be  the 
set  of  attributes  in  R.-Y^  ,  i.e.  R ^  «{  W,  Yfc.j  }  and  R^  -{  Y%.. 
,Zj. 

Let  N(YW)»  number  of  w  values  in  R<  [W]  per  y  and 
N(YZ)  ■  number  of  z  values  in  Rj  [Z]  per  y. 

N(l)«number  of  y  in  R£[Y{j]  . 

N  ( 2 )  ^number  of  y  in  R j  [  Y  %*j  ]  • 

W(0)«number  of  common  y  in  Rt[Y£j  ]  and  Rj[Y£j  ]• 

Thus  vt  ■  N(l)  *  H(YW)  ,  Vj  -  N(2)  *  N(YZ). 
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After  semijoin  R.  X|Rj  and  R.  X|Rt  ,  then 
vc  -  N(0)  *  N(YW)  «  vc  *  (1  -otji)  and 
«  N(0)  *  N(YZ)  -  vj  *  (l-«*jj). 

The  size  of  R.  |X|Ry  is  N(Join)-  N(0)  *  N(YW)  *  N(YZ). 
According  to  the  definition  of  reducibilities,  we  have 
N( Join)  *  vt*  Vj*  (1  -etij)*  U-  fij) 

-  N ( 1 )  *  N{YW)  *  N(2)  *  N( YZ) 

*  (1-Wcj)*  (1-  fij) 

«  N( 0 )  *  N(YW)  *  N  ( 0 )  *  N ( YZ )  *  (1-  fy  ) 

«  N(0)  *  N(YW)  *  N ( YZ) 

This  implies  N(0)  ■  1  /(1-^cj). 

Since  N( 0 )  *  N(YW)  -  N(l)  *  N(YW)  *  (1-^-Jr 
N(l)-  1  /((l-^cj)  *  (1-  fil  )). 

Similarly,  N(2)-  1  /((l -*4y  )  *  (1-  fij  )). 

Thus,  we  will  estimate  the  number  of  rows  after  projection 


of  R  •  over  Y;y  by 

1  /( (1-  tiij)  *  (1-  fij  )). 

If  d(j  *1 1  then  R^  and  Rj  do  not  have  common  values  in  the 
joining  column  of  R  and  R  .  then  ciji  ■  fij  *1.  In  fact. 


either  one  of  cCy  ,  oCj-  or  equal  to  1  will  implies  another 
two  also  equal  to  one.  If  one  of  them  is  equal  to  1  then  the 


result  is  empty. 


K 
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Example : 

Suppose  ve  have  three  relations  R(  ,R2  and  R ^  , 
where  R .  A  Rj»<^  and  R-tRiV  i,jj  and  suppose  v^fO], 
c<cj [ 0 ] ,  pt-[ 0 ]  are  given.  Then  the  processing  graph 
will  be: 


Let  P( -a3ibl3b3>  and  Pa«  aJaa)3  b3|  ba|  be  two  programs. 
Both  two  programs  P,  and  Pa  will  produce  the  same 
results  R,  [0]  |X|  Ra[0]  |X|  R3[0]  at  site  1.  (see 

Figure  2.4)  By  the  rules  of  estimating  the  size  of 
the  derived  relation,  the  estimate  sizes  of  R( [0] 

| X |  R_[0]  | X |  Rj[ 0 ]  derived  by  these  two  programs 

*  *  JL 

will  be  the  same.  Which  is  M  *JTv[0]  *  .TT  (i- 
i  *&} 

COD  *7T.,(1-  P*jC°l)  *  «here  M-min{l-(l-  pfl[0])}. 

A<j  * 


After  performing  ajz 
v([l]-v,[0] 

Ya  0 ] ) 

Vjtll-VjIO] 


iia  «nA  —  ■  ^ 


efofi3*efa[03 

<4itl3*^ait0] 

o<atl3-o<iitO] 

o<Mtl3-»4(t03*e4,I03-«4,t03*»<sJ0] 

p,»tl3-  l-  (l-  ^i»t03)/U-»4»[01) 

(?(;[!]*  ^j[°3  otherwise 

After  performing  bi3 

v,[2]«v,[l] 

vit23*vj[i3 

v3[23-vJ[l3*vi[l3*U-o<1J[l3)*a-04Jtl3)*(i-  fa [13 
-»3  [01‘v^COl‘d-  ^}1[0J)*(1-  o(,,tO])*(l-  £,[01 

o^m-o 
o4}l23-0 
o^[23-  *«[13 
0<a,[2]-^,tl3 

«<ut23-o(,|[0]*  o4,at03-  ^„(03‘  e<  JO] 

<£|t  J3*  O^toi*  *^mI03- O<»lt03‘  e/^oj 

P»j[23-  1-  1/v*  113 

fttI23-  M1)-  p.»t°3 

^,,[2).  1-  M*(l-  (»,j[0])*(l-yl?<J[03) 

After  performing  bj. 

*lI33*»,t2] 
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v3  C3]*Vj  [2] 

M3]-v|I2]*vJt2]*(l-o/,Jt2])*a-*4/t2))*U-/»J/[2]) 


i  2,5 

-  M*TTv.[0].  .TT  (1-^tOD*  XU- 

*V**I  J  */J*l  l  J 


■a  a  .  b 

/i  u  3| 


After  performing  at4 

v,  [l]-v,  [0] 

vA[l]»va[0]*(l-^/2[0]) 
v3  [l]»v3  [0] 
o^[l]«0 
^a,[l]«o4,[0] 

^/3[1]-o43C0] 
oig/  [1]* 

^[1]»  ^[OlMalO]-  o45C0]*  *U0] 

Paj[l]«  1-  <1-(5»3[0])/(1-^,x[0]) 
(W0!  otherwise 

After  performing  a(^ 

v|[2]-v,[0] 

va[2}-va[0]*<l-o<u[0]) 

*3  I2]’*jt0]*U-^3[0]) 

W„.t2].0 

oi,j[2]-0 

«<»,  [21"<4|t0] 

JlltSl-o'jltO] 


0^.1 2)“  o4j[0]+  o^a  [0]-  o/u[0]*  o4*[0] 

^,[2]-  1-  (1-  (U[13)/(W„[0J) 

■  I"  (1-  P*j[0J)/(W,3[0])*(1-*4*[0]) 
pKn[2]»  ph|l[03  otherwise 

After  performing 

vl  1 3 ] -v,  £0]*vJ[0]*(l-e^J[0])*(l-o/tl[0])*(l-  |S„[0]) 

''at3}-»i[2] 

v3[3].»3t2] 


-<1j[3]*w'|,[2]-0 

»<«t3]»^j[21 

•^131-^,12] 

<4|  [3]*(<1|[0]*»<»J[0J-  o4,[0]*  .{,[01 

=<,xt3l-  *u.[2l  +  4,»[3]-  ^[2]*  Jm[2] 


^,[3].  1-  1/Vj  [2] 

^,t3]-(3„t2]-  (^,[0] 

(W3J-  l-M*(l-p,[2])*U-  fto[2J) 


After  performing  b^» 


VA 1 4  ]  -va  [  3  ] 
V3  [  4  ]  ■  Vj  [  3  ] 


v,  [43-v,[3]*v^[3]*U-^a[3])*(l-^|[3])*U-  ^[3]) 

■  fa*  U-ol$lOW  fa  u-fyto]) 

A+J  f<i 
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In  practical  implementation,  we  can  initially  set«/t- 


and  ptj  for  each  pair  of  relations  by  some  number  which  we 
can  intuitively  guess  at  the  time  that  the  databases  are 
implemented.  At  each  node,  we  set  some  mechanisms  to  record 
the  statistical  information  of  semijoin  reducibilities  and 
join  reducibilities.  After  the  systems  are  running,  we  can 
update  the  number  of  tuples  in  each  relation  and  the 
reducibilities  of  each  pair  of  relations. 


In  processing  a  given  query,  we  perform  initial  local 
processing  (including  projection  and  selection  operations) 
and  attribute  renaming  process  first.  Assume  the  size  of 
relations  {v^  5^,  »  reducibilities  {  fiy  1  and  the 

number  of  tuples  of  relations  associated  in  solving  the 
given  query  after  local  processing,  v^- [0],  for  relation  R; 
are  known.  The  reducibilities  {  ^[0]}  of  the 

resulting  relations  associated  with  this  query  will  depend 
on  the  local  processing.  In  the  following,  we  will  derive 
formulas  for  ^*[0]  and  p<)[0]. 

Let  W  be  the  set  of  attributes  in  R  •  -Y.:  and  Z  be  the 

A 

set  of  attributes  in  Ry  -Y*j,i.e.  R  ^  ■{  W,  Y^j  and  R  .  »{  Y {. 
,Z) .  Let 

N(YW)»  number  of  w  values  in  [W]  per  y; 

N(YZ )  ■  number  of  z  values  in  Rj  [Z]  per  y; 

N(A)«number  of  common  y  in  R^IY^j  ]  and  R  >  [Y  ^  ]  before 
local  processing; 
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N(B)«number  of  common  y  in  R^-[Y  t-j  ]  and  R.  [Yt*j  ]  after  local 
processing. 

We  illustrated  as  in  the  following  figure: 

Before  local  processng 


After  local  processing 


v,[0] 


v  j  I  o  ] 


We  assume  the  number  of  common  elements  of  the  joining 
domain  being  reduced  is  proportional  to  the  size  of 
relations  being  reduced, 

i.e.  N(B)  -N(A)  *  (v{[0]/v.  )  *  (vj [0]/w}  ). 

So  N(B)  *  N(YZ) 

-  N(A)  *  (vJOJ/v,  )  *  (v;[0]/v}  )  *  H(YZ) 

-  vj  *  (1-  )  *  (vc[0]/vc  )  *  (VjtOJ/v.  ) 

-  v.  [0]  *  (1-  ot;j[ 0 ] ) . 

This  implies  (1-  (VitOJ/v.  )  ■  1- 
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So  we  have  o^j[0]  »  1  -(1-atcj)  *(v  ■  [0]/v£  ). 

Similarily ,  we  have  o^;[0]  -  1  -(1- *(vj  [0 ]/Vj  ). 

Because  N(B)  *  N(YW)  *  N(YZ) 

-  N(A)  *  (V;  [0]/vc  )  *  (v.  [0]/Vj  )  *  N(YW)  *  N(  YZ) 

,  v<i  *  Vj  *  n-o/ij)  *  U-  JjC  )  *  (1  -fa) 

*  (Vi  [0]/vt  )  *  (Vj  [0]/Vj  ) 

-  VJO]  *  Vj[0]  *  <l-o<ij[0])  *  d-^[0])  *  (l-ft[0J) 

-  v.[0]  *  Vj  [0]  *  (1-  0 (ij  )  *  (V.  [03/v.  )*  ) 

*  (vj  [0]/v.  )*  (1-  fy[0J) 


This  implies 

(1-  fij[0])  *  (V;  [0]/V.  )  *  (Vj  [  0  ]  /V  j  ). 

That  is  fij[0]-  fcj  ♦  (1-  fij)*  (1-  ( v  j /v  ; [ 0  J ) * ( vy  /Vj  [ 0 ] ) ) . 

Me  choose  p(j[ 03  to  be  max{  0,  +  (1-  )*  (1-  (v;  /vc 

[0] )*(v j  /vj  [03))}.  The  set  of  numbers  {  v.[0),  o4j[0), 

C 0 J 5  are  the  initial  values  for  analyzing  this  given  query. 


82 


2.4.4  Problem  Formulation 

In  order  to  write  down  the  mathematical  formulation  of 
the  distributed  query  optimization  problem,  we  need  to  know 
the  cost  function  of  each  operation.  From  our  previous 
assumption  of  a  linear  cost  function,  we  can  write  down  the 
cost  function  of  operations  at  stage  t. 

The  cost  of  operation  a  .j  will  be 

Cost  (a  ..  )«c>  *(  vv>.  *  Z  w  ( A) ) 

*0  Y‘J  •  MY, j 

and  the  cost  of  operation  b >•  will  be 

Cost(b.*  )«c  .,*(  v. [ t ]  *  2j  w(A) ) . 

J  A*lc 

Based  on  the  distributed  query  processing  model  we 
developed,  the  formulation  of  the  distributed  query 
optimization  problem  is  as  follow: 

INPUT: 

1.  a  distributed  database  schema  D«{  Rt [0] , . . . ,Rn[0] } 

2.  the  width  w(A)  of  each  attribute  A  in  U(D) 

3.  the  number  of  rows  ,^[0],  of  each  relation 

4.  the  semijoin  reducibility  o kj  [0]  of  each  pair  of 
relations  R*  &  Rjwith  o/t^[0]-0 

5.  the  join  reducibility  jJ.j  [0j  of  each  pair  of 

relations  R;  &  Rjwith  jk;[0]»0  and  j5<j[0]- 
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OBJECTIVES 

Find  an  optimal  join-semi  join  program  to  solve  the 
natural  join  program. 

Let  then  the  problem  can  be  written  in  the 

form: 

a 

Min  23  cost(px) 

P  r 

s.t.  D[t]-f  ,  (  D[t-1]  ) 

A[t]-fa  (  A[t-1],  B[t-1] ) 

B[t ]*f 3  (  A[t-1],  B[ t-1 ] ) 
v[t]-f+  (  v[t-l],  A[ t~l ] ,  B[t-1]  ) 

and 

vtO],  A[ 0 ] ?  B[0]  are  given. 

Where  A[0]-  [  oiij  [0]  ]  and 
B[0]-  [  [0]  ]  are 

the  initial  reducibility  matrices. 
v[0]«[  v.  [0]  ]  is 

the  initial  size  of  relation  D  . 
f|(D[t-l])  is  the  mapping  from  temporary  database 
state  at  stage  t-1  by  the  operation  p^.  to  a  new 
temporary  database  state  at  stage  t.  During  the 
analysis  of  a  query  solution  strategy,  the  database 
state  is  conceptually  considered  changing  from  one 
state  to  the  other  state  by  an  operation.  The  real 
database  state  stored  in  the  system  does  not  change 


84 


9 

>  ' 

►  t 
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unless  update  operations  are  really  performed. 

and  tx ,f3  and  f^,  are  mappings  according  the  definition 

of  reducibilities  and  its  changing  rules. 


If  p^ -a.^  ,  then 
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B[t] 


filial 
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2.5  Extend  The  Model  to  Include  Local  Processing  Cost 

The  underlying  assumption  of  this  chapter  is  that  of 
communication  cost  dominance.  This  is  usually  true  in  the 
case  of  a  large  scale  communication  network.  When  databases 
are  distributed  in  a  local  area  network  environment,  the 
local  processing  costs  also  play  a  significant  role  in  query 
processing  because  selection,  projection  and  join  operations 
sometime  take  a  significant  processing  time  to  process  the 
operations.  In  the  case  where  local  processing  costs  are 
comparable  to  communication  costs,  this  model  can  be  easily 
extended  to  cover  the  situation  by  providing  a  method  for 
estimating  the  amount  of  processing  time  required  and 
associating  each  node  with  a  local  processing  cost.  A  method 
of  estimating  the  local  processing  cost  of  selection, 
projection  and  join  operations  was  studied  by  [SEL  79]  and 
[RIM  80],  etc.  This  cost  depends  on  the  method  of 
implementing  join  operation  and  the  available  main-memory 
buffer  space. 

2.6  Conclusions 

In  this  chapter,  we  considered  the  query  processing 
problem  in  a  distributed  relational  database  envirnment,  and 
we  extended  previous  work  to  consider  a  larger  class  of 
solution  strategies  for  equi-join  query  processing. 
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We  have  developed  a  mathematical  model  to  compute  the 
minimum  communication  cost  of  a  join-semi  join  program  for 
processing  a  given  equi-join  query.  We  defined  a  query 
processing  graph  for  each  equi-join  query  and  characterize 
the  set  of  join-semi join  programs  which  solve  this  query.  A 
rule  for  estimating  the  size  of  the  derived  relation  is 
assumed.  With  the  assumption  of  communication  cost 
dominance,  when  the  cost  functions  are  linear  in  the  size  of 
data  transmission,  an  optimization  problem  for  distributed 
query  processing  is  formulated  and  solved.  This  model  can 
be  extended  to  the  case  where  local  processing  costs  are 
significant  and  nonnegligible  by  associating  each  node  with 
a  local  processing  cost  and  providing  a  method  for 
estimating  local  processing  cost. 

Although  the  model  is  based  on  processing  the  class  of 
equi-join  queries  which  is  a  subset  of  complete  relatioanl 
calculus  language,  it  is  a  rich  and  large  class  of  queries 
in  practice. 

In  a  general  query  processing,  we  can  divide  the 
clauses  in  the  qualification  of  a  query  into  two  sets:  The 
set  of  equality  clauses  and  the  set  of  inequality  clauses. 
Usually  the  set  of  inequality  clauses  is  very  small.  We  can 
either  process  the  set  of  inequality  clauses  by  using  the 
inequality  joins  and  inequality  semijoins  first  and  leave 
the  remaining  equi-join  query  solving  by  using  the  model  or 
vice  versa.  The  other  approach  is  to  change  all  inequality 
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join  clauses  into  equality  join  clauses  by  referencing  the 
domains  of  join  attributes. 

A  future  research  topic  is  to  extend  this  model  to 
cover  a  large  class  of  inequality  join  queries  by  providing 
a  method  to  measure  the  reducibilities  and  to.  estimate  the 
size  of  the  derived  relations. 


91 


Chapter  3 

Computation  Complexity  of  Distributed 
Query  Processing  Problem 


3.1  Introduction 

In  distributed  database  management  systems,  the 
efficiency  of  query  processing  has  a  strong  influence  on  the 
performance  of  the  systems.  Query  processing  in  a 
distributed  system  is  different  from  that  in  a  centralized 
system.  In  a  distributed  system,  data  are  stored  at 
computers  which  are  geographically  separated;  hence  query 
processing  involves  some  local  data  processing  and  the 
necessary  data  transmission  over  communication  links. 
Although  both  types  of  operations  will  introduce  time 
delays,  for  a  large  network  of  databases  the  transmission 
delay  plays  the  major  role  in  the  overall  system 
performance.  Many  researchers  and  system  developers  have 
considered  these  facts  and  have  derived  ways  of  finding  a 
distributed  processing  strategy  for  data  processing  and  data 
transmissions.  It  is  recognized  that  deriving  an  optimal 
distributed  processing  strategy,  in  the  sense  that  some  cost 
is  minimized,  is  a  very  difficult  problem  and  all  algorithms 
that  have  been  proposed  to  date  are  heuristic.  No  one  has 
yet  been  able  to  show  that  finding  an  optimal  query 
processing  strategy  in  distributed  databases  is  an 
intractable  problem. 
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In  the  last  chapter,  under  the  assumption  that  data 
transmission  costs  dominate  tha  local  processing  costs,  we 
developed  a  model  for  solving  an  equi-join  query  by  a 
strategy  employing  a  mixed  sequence  of  joins  and  semi joins. 
The  objective  is  to  minimize  the  total  data  transmission 
costs  for  processing  the  query.  If  we  perform  all  the 
possible  semi join  operations,  as  in  the  strategy  used  in 
SOD-1  [GBWRR  81],  the  remaining  problem  becomes  one  of 
finding  a  routing  strategy  of  sending  required  data  to  the 
site  where  the  query  was  initiated  with  a  minimum  total 
transmission  cost.  In  JDD-l,  the  system  takes  the  simplest 
way  by  sending  all  the  data  to  the  query  initiating  node  and 
processing  at  that  node.  In  this  chapter,  we  will  consider 
the  case  where  all  semijoin  reducibilities  are  equal  to  zero 
and  the  join  reducibilities  are  not  affected  by  join 
operations.  We  call  this  problem  the  query  processing 
problem  (QP) .  For  our  purpose,  we  formulate  the  problem  in 
the  following  way. 


Query  processing  problem  (QP): 


Given  a  complete  directed  graph  G-(V,E);  the  size  of 
data  associated  with  each  node  i,  s*  ;  the  unit 

A* 

communication  cost  of  edge(i,j),  c**  j  and  the  join 
reducibility  associated  with  edge(i,j),  d^  .  Here  d^  has 
the  same  interpretation  as  1-  in  chapter  2.  All  semijoin 
reducibilities  in  chapter  2  are  equal  to  zero.  The  size 
of  data  at  node  j  after  the  arrival  of  data  from  node  i  and 


The  cost  of  this  subtree  is 


s,*c,3  +  sa-c23*+  ( s,-sz-  s3-  du  *^23)*  C34 
4  (  S,‘  Sa- S3  * 4i3*  d2y  s4  ‘^34  )  ’^40 


Figure  3.1  The  cost  of  a  subtree 


the  join  operation  between  them  will  produce  the  size  of 

data  at  node  j  equal  to  s*  s;  dv;  .  Here,  d  •*  ■  d  *  •  for 

*  t  *3  3 

each  pair  of  i,j  .  He  define  a  subtree  of  a  tree  as  the 
usual  meaning  of  subtree  with  all  edges  pointing  to  the 
direction  of  the  root  of  the  subtree.  Figure  3.1 
illustrates  an  example  of  the  cost  of  a  subtree. 

Our  objective  is  to  find  an  inversely  directed  spanning 
tree  toward  node  V#with  minimum  communication  costs.  Note 
that  this  model  is  different  from  the  model  stated  in 
chapter  two.  The  model  in  this  chapter  is  not  consistent  in 
estimating  the  sizes  of  the  data  by  two  strategies  which 
will  generate  the  same  results.  For  example,  in  figure  3.1, 
If  we  join  data  in  node  1,  2  and  3  by  joining  data  from  node 
1  to  node  3  and  then  joining  data  from  node  2  to  node  3  will 
result  the  size  be  s,  ‘s^s^d^d^  .  On  the  other  hand,  if  we 
do  it  by  joining  data  from  node  1  to  node  2  and  then  joining 
the  result  from  node  2  to  node  3  will  result  the  size  be  s, 
*sa*s3*d/Jt  *d3j  .  We  will  show  that  under  this  model,  three 
problems  of  finding  a  routing  strategy  of  sending  required 
data  to  the  desired  site  are  NP-complete. 

This  chapter  is  organized  as  follows:  In  the  next 
section,  we  review  complexity  theory  and  list  the  three 
NP-complete  problems  we  need  to  use  for  proving  our  results. 
In  section  3.3  ,  we  prove  the  NP-complete  results  for  the 
query  processing  problem  with  a  bounded .number  of  nodes  in 
each  subtree  (QPBS)  by  using  the  satisfiability  problem 
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(SAT) ,  for  the  query  processing  with  a  bounded  number  of 
subtrees  (QPBT)  by  using  the  bounded  component  spanning 
forest  problem  (BC),  and  for  the  query  processing  problem 
with  maximum  benefit  (QM)  by  using  the  exact  cover  by  3-sets 
problem  (X3C).  The  conclusion  is  given  in  section  3.4. 

3.2  Complexity  Theory 

Recent  developments  in  the  theory  of  computational 
complexity  provide  a  powerful  method  for  comparing  the 
computational  difficulties  of  different  problems.  It  often 
can  provide  information  useful  to  algorithm  designers.  The 
application  of  this  theory  to  combinational  problems  has 
aroused  the  interest  of  many  researchers.  The  foundations 
for  the  theory  are  in  the  paper  of  Cook  [Cook  71]  and  of 
Karp  [Karp  72]  who  first  explored  the  relation  between  the 
classes  P  and  NP  of  (language  recognition)  problems  solvable 
by  deterministic  and  non-deterministic  turing  machines, 
respectively,  in  a  number  of  steps  bounded  by  a  polynomial 
in  the  length  of  the  input.  In  this  context,  all  problems 
are  stated  in  terms  of  recognition  problems  which  require 
yes/no  answers.  Por  the  combinatorial  optimization  problem, 
we  transform  it  into  the  problem  of  determining  the 
existence  of  a  solution  with  value  at  most  (or  at  least) 
equal  to  y,  for  some  threshold  y.  The  class  NP  is  very 
extensive.  It  contains  several  classical  problems  ranging 
from  the  satisfiability  problem  of  propositional  calculus  to 
the  traveling  salesman  problem  for  which,  despite  many 
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lengthy  and  intensive  efforts,  no  efficient  algorithm  is 
known  at  present. 

A  problem  is  said  to  be  NP-complete,  if,  intuitively, 
it  is  as  hard  as  any  problem  in  NP.  Proving  that  a  given 
problem  is  NP-complete  typically  requires  two  steps: 

1.  Showing  that  this  problem  is  in  NP  by  describing  an 
efficient  nondeterministic  algorithm  solving  it. 

2.  Shoving  how  a  known  NP-complete  problem  can  be  reduced  to 
the  given  problem  via  a  polynomially  time-bounded 
transformation. 

The  following  theorem  lists  those  NP-complete  problems 
that  will  be  used  in  this  chapter  to  establish 
NP-completeness  of  the  distributed  query  processing  problem. 

Theorem  1:  The  following  problems  are  NP-complete: 

(1)  Satisfiability  problem  in  the  conjunctive  normal  form 
(SAT) 

Given  a  set  {x(,....,xa}  of  variables  and  a  set  c|f...,cm 
of  clauses.  Each  clause  is  the  disjunction  of  literals 
(i.e.  variables  or  negations  of  variables).  He  are  asked 
to  determine  whether  or  not  the  conjunction  of  c cm 
is  satisfiable,  i.e., whether  there  is  an  assignment  of 
the  values  true  and  false  to  each  of  the  variables,  so 
that  each  clause  contains  at  least  one-  true  literal. 
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(2)  Bounded  component  spanning  forest  (BC) 

Given  a  complete  graph  G«(V,B),  and  given  a  nonnegative 
weight  W(v)£Z+  associate  with  each  v£V. 

Assume  positive  integers  K£  |V|  and  B  are  also  given. 
Can  the  vertices  in  V  be  partitioned  into  k  <  K  disjoint 
sets  V,  ,Va , . .  ,VK  so  that  for  1  £  i  k,  the  subgraph  of 
G  induced  by  V-  is  connected  and  the  sum  of  the  weights 
of  the  vertices  in  V-  does  not  exceed  B? 

At 

(3)  Exact  cover  by  3-sets  (X3C) 

Given  a  finite  set  Xx  , 1  with  |X|  >3q  and 

a  collection  C«{  (T4/6i/  — /  1  of  3-element  subsets 

of  X  .  Does  C  contain  an  exact  cover  for  X  ?  i.e.,  Does 
there  exist  a  subcollection  C'&C  so  that  every  element 
of  X  occurs  in  exactly  one  member  of  C'  ? 

Proof : 

(1)  see  reference  [KARP  72] 

(2)  see  reference  [GAJO  79] 

(3)  see  reference  [KARP  72] 

3.3  Computation  Complexity  of  Query  Processing  Problem. 

In  this  section,  we  show  that  three  problems  of  finding 
a  routing  strategy  of  sending  required  data  to  the  desired 
site  are  NP-complete.  The  first  problem  is  the  query 
processing  problem  with  the  constraint  that  the  number  of 
nodes  in  each  subtree  is  bounded.  The  second  problem  is  the 
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query  processing  problem  vith  the  constraint  that  the  number 
of  subtrees  of  the  root  of  the  directed  spanning  tree  is 
bounded.  The  third  problem  consider  the  maximun  benefit  of 
routing  strategy  which  has  nothing  to  do  with  the  minimum 
cost  query  processing  problem. 

3.3.1  Query  Processing  Problem  With  a  Bounded  Number  of 
Nodes  in  Bach  Subtree  (QPBS) 


We  formulate  the  problem  as  follows: 

Assume  we  are  given: 

*  a  complete  graph  G«(V,E) 

*  s. ,  the  size  of  data  associated  with  each  node  v. 

* 


* 


(if  j) 


the  unit  transmission  cost  associated  with  edge 


*  d^  ,  the  reducibility  associated  with  each  pair  of 
nodes  (i,j) 

*  B  is  rational  constant  and  K  is  integer  constant. 

*  v0  ,  the  final  node  (query  answer  node) 


Does  there  exist  a  directed  spanning  tree  toward  v#  so 
that  the  cost  of  the  spanning  tree  is  at  most  B  and  the 
number  of  nodes  in  each  subtree  of  v# is  at  most  K? 


Theorem  2:  QPBS  is  NP-complete 


Proof : 

Consider  the  SAT  problem  in  theorem  1.  We  will  show  that  SAT 
is  reducible  to  QPBS,  i.e.,  that  for  any  instance  of  SAT  an 
instance  of  QPBS  can  be  constructed  in  polynomial-bounded 
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time  so  that  solving  the  instance  of  QPBS  solves  the 
instance  of  SAT  as  well.  The  theorem  then  follows  from  the 
NP-completeness  of  SAT  in  theorem  1  and  the  fact  that  QPBS 
belongs  to  NP,  since  any  feasible  spanning  tree  toward  node 
v0  can  be  recognized  as  such  in  polynomial  time. 

Given  any  instance  of  SAT,  we  write  a  set  of  variables 
as  (x,  ,,..,1^]  and  a  set  of  clauses  as  {c,  ,cx, . . ,cml ,  and 
define  an  instance  of  QPBS  as  follows: 

G-(V,E)  is  a  complete  graph  such  that: 

*  V-{v0  ,  *,»•••  »*,»••  •  »  ci  »"*,  c»»  •  *'* 

*  each  node  is  associated  with  a  size  2,  i.e.  s- *  2  V i 

A* 

*  each  edge  is  associated  with  a  reducibility  ■  1/2 

*  the  cost  of  each  edge  is  defined  as  follows: 

cost  (dj  ,  d,  )■  1  Vj  •  2,***,® 
cost  (d,,  x,)  -  cost  (  d(,  x,)  ■  1 

cost  (  x^,  x^)  -  cost  <x^,  xiH)  -  cost  (  r,  x^  )  » 

cost  (  xi#  xi4l)  •  1 
V  i  »  1,  2,—  ,  n-1. 

cost  (  xrt,  vtf)  »  1,  cost  (  x„ ,  v# )  «  1 

cost  (  c. ,  s';  )  ■  1  if  0*  is  a  literal  in  c.  where  r> 

*  J  J  j 

is  either  x*  or  x.  . 

J  » 

*  all  other  edges  have  high  cost. 

*  B  ■  2  (  2n+2m) 

*  K  ■  n+m 

Pigure  3.2  illustrates  this  reduction. 
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Bach  node  is  associated  with  a  size  2  and  each  edge  has 
a  reducibility  of  1/2  .  Edges  of  infinite  length  are  not 
shown.  The  solution  of  QPBS  corresponding  to  the  truth 
assignment  x(*  false,  x^*  x^-true  is  shown  with  a  heavy 
line. 


We  claim  that  SAT  has  a  solution  if  and  only  if  G 
«(V,E)  contains  a  spanning  tree  towards  v#so  that  the  cost 
of  each  subtree  is  bounded  by  B  and  the  number  of  nodes  in 
each  subtree  of  v0  is  bounded  by  K. 

(  )  Assume  there  is  a  truth  assignment  t  satisfying  SAT. 

Then  the  tree  T  with  da  ,dj  d^through  d,  and  then 

through  those  0-value  literals  of  {x^.  , x^ }  and  then  to  v0  as 
one  subtree,  and  c, ,ca,...,  cm  through  the  1-value  literals 
and  then  to  v0  as  another  subtree  of  v0  will  have  cost  2  ( 

2n+2m  )  and  each  subtree  has  n+m  nodes. 

(<F=r )  Suppose  QPBS  has  a  tree  solution  T.  Note  that  T  must 
have  2n+2m  edges.  Since  B  »2*(2n+2m),  and  the  shortest  edge 
of  G  has  length  1,  it  follows  that  T  must  consist  of  2n+2m 
edges  of  length  1.  Also,  since  there  are  two  edges  of  length 
1  incident  upon  node  v0  and  K  is  half  the  number  of  nodes  in 
G,  T  has  exactly  2  subtrees  rooted  at  node  ve  ,  each  of 
cardinality  (not  including  v0)  n+m.  The  m  nodes  d(  ,dA,...,  d^ 
are  obviously  in  one  such  subtree.  The  only  way  that  these 
nodes  can  be  connected  to  x(  or  7(  is  by  a  path  P  traversing 
one  node  from  each  pair  (x.  ,  S\  ). 


Let  a  literal  have  the  value  false  if  the  corresponding 

node  is  in  P.  Since  d(,da, ...,  dm  together  with  P  exhausts 

the  m+n  vertices  allowed  in  a  subtree,  the  nodes  {  x^.,  i. } 

not  in  P  and  (c^l  must  constitute  the  other  branch.  Also, 

for  each  c;  ,  there  is  a  node  not  in  P,  such  that  c.  is 
J  j 

adjacent  to  it  .  Hence  each  clause  of  SAT  contains  a 
literal  that  is  assigned  the  value  true  by  the  above  truth 
assignment.  Thus  SAT  is  satisfiable. 

3.3.2  Query  Processing  With  a  Bounded  Number  of  Subtrees 
(QPBT) 

We  next  consider  the  objective  to  be  finding  a  directed 
spanning  tree  toward  final  node  v#with  a  minimum  number  of 
subtrees  such  that  the  cost  of  each  subtree  is  bounded  by  B. 
We  prove  that  this  problem  is  still  NP-complete. 

This  problem  is  formulated  as  follows: 

Assume  we  are  given 

*  G  ■  (V,E)  a  complete  graph 

*  S; ,  the  size  of  data  associated  with  each  v. 

*  c..  ,  the  unit  transmission  cost  associated  with 

edge  ( J , j ) 

*  d--  ,  the  reducibility  associated  with  each  pair  of 

J 

nodes  (i,j) 

*  B  and  K,  constants 

*  the  final  node,  v# 

Does  there  exist  a  spanning  tree  toward  node  v  such  that 

o 

the  cost  of  each  subtree  is  bounded  by  B  and  the  number  of 
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the  subtrees  is  no  greater  than  K? 

Theorem  3:  QPBT  is  NP-complete 
proof : 

Note  that  the  proof  of  theorem  2  works  trivially  for  theorem 
3  also.  Here  we  give  another  proof.  Consider  the  BC 
problem  of  Theorem  1.  We  will  show  that  BC  is  reducible  to 
QPBT,  i.e.,  that  for  any  instance  of  BC  an  instance  of  QPBT 
can  be  constructed  in  polynomial-bounded  time  so  that 
solving  the  instance  of  QPBT  solves  the  instance  of  BC  as 
well.  The  theorem  then  follows  from  the  NP-completeness  of 
BC  in  theorem  1  and  the  fact  that  QPBT  belongs  to  NP,  since 
any  feasible  spanning  tree  toward  node  v0  can  be  recognized 
as  such  in  polynomial  time. 

Given  any  instance  of  BC,  we  denote  the  graph  by  Gg  ■ 
< VB ' Eg ) ,  *nd  the  weight  of  each  node  v^6Vg  by  W(v.  )  €  Z*  .  We 
construct  an  instance  of  QPBT  as  follows: 

*  G  *(V,E)  is  a  complete  graph  with  V  ■  V  U  {v  (final  node)} 

*  s .  -1  Vi 

*cCj -W(Vj)  Vi 

*  -  1  V  i ,  j 

*  K  6  B  are  the  same  as  in  BC 

The  subtree  example  following  has  the  communication  cost 
W(v, )  ♦  W(va)  +  W(vj  )  ♦  W(v4). 


Figure  3.3  Illustration  of  reduction  from  BC  to  QPBT 
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We  claim  that  BC  has  a  solution  iff  the  constructed  6 
»(V,B)  contains  a  spanning  tree  toward  v  so  that  the  cost 
of  each  subtree  is  bounded  by  B  and  the  number  of  the 
subtrees  is  bound  by  K. 

(  =^>)  Asstime  that  BC  has  a  solution 

i.e.  the  vertices  in  V  can  be  partitioned  into  k  4  K 
disjoint  sets  V,  vKso  that  for  1(  i£  k,  the  subgraph 

of  6  induced  by  \N  is  connected,  and  the  sum  of  the  weights 
of  the  vertices  in  V;  is  <  B. 

We  construct  a  subtree  T.  for  each  set  V*  and  connect  it  to 

A  Af 

v  and  each  subtree  T.  has  comm,  cost  ■  T*.  s  ■  W(v.  )  £ 

A  £6TC  1  UTc 

B  and  the  number  of  subtrees  kSK. 

(<^=  )  The  reverse  of  the  argument  is  the  same  as  above. 

3.3.3  Query  Processing  Problem  With  Maximum  Benefit  (QM) 

In  the  query  processing  problem,  if  we  associate  each 
edge  with  the  unit  benefit  of  transmitting  data  through  that 
edge  instead  of  with  the  unit  cost,  then  the  problem  becomes 
finding  a  maximum  benefit  spanning  tree  toward  node  vp .  We 
show  that  this  problem  is  still  NP-complete.  Note  that  this 
problem  has  nothing  to  do  with  minimum  cost  query  processing 
problem  because  we  cannot  transform  minimum  cost  QP  problem 
to  this  problem. 


We  formulate  the  problem  as  follows:  • 
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Assume  we  are  given: 

*  G  •  (V,E) 

*  for  each  node  v.  ,  associate  data  size,  s. 

A* 

*  for  each  pair  of  nodes  (i,j),  associate  a  join 
reducibility  d-j  ,  and  a  unit  benefit  by 

*  final  node  v0 

*  constant  B 

Does  there  exist  a  spanning  tree  toward  v0  such  that  the 
total  benefit  is  no  less  than  B  (&  B)? 

Theorem  4:  QM  is  NP-complete 
proof : 

Considering  the  Z3C  problem  in  theorem  1.  We  will  show  that 
x3c  is  reducible  to  QM,  i.e.,  that  for  any  instance  of  Z3C, 
an  instance  of  QM  can  be  constructed  in  polynomial-bound 
time  such  that  solving  the  instance  of  QM  solves  the 
instance  of  Z3C  as  well.  The  theorem  then  follows  from  the 
NP-completeness  of  Z3C  in  theorem  1  and  the  fact  that  QM 
belongs  to  NP,  since  any  feasible  spanning  tree  toward  node 
v  can  be  recognized  as  such  in  polynomial  time. 

Given  any  instance  of  Z3C,  we  write  a  finite  set  of 
elements  Z  a{  *  C*  , ...,  and  a  collection  C  of 

3-element  subsets  of  X:  C  ■  {  <Tj  ,  <j^  , . . . ,  1. 


We  define  an  instance  of  QM  as  follows: 


107 


G  «(V,E)  is  a  complete  graph 

*  v  ■  {  #  <£  » •  •  • »  Of  ,Z  t  *  Zt  ‘  r  Zjj'  v«  1 

*  for  each  node  i ,  s  .  -4 

A 

*  d^-  1/2  Yi,j 

*  b  (  zc ,  rj  >  «  i  if 

b  (  or,  ira)  -  1  j  -  1,2,... ,r 

<# 

b  (  m,  n  )■-©  otherwise 

*  B  ■  40q  +  4r 

Figure  3.4  illustrates  this  reduction. 

We  claim  that  Z3C  has  a  solution  if  and  only  if  G 
*(V,E)  contains  a  spanning  tree  with  the  total  benefit  no 
less  than  B. 

(  ■=P‘)  X3C  has  an  exact  cover  c'£C,  then  the  spanning  tree 
with  edges  (  2T,  ,  <Tj  )  V<FJ€C'  and  (  CT  ,  V«  )  i-  l,2,..r, 
has  benefit 
3q  4  +  32  q  +  4  (r-q) 

*44q  -  4q  ♦  4r 
■40q  +  4r  >  B 

(<^=s  )  Considering  the  spanning  tree  toward  node  v#  in  fig. 
3.4,  there  are  only  three  types  of  first-level  nodes 
directed  toward  the  second-level  nodes:  (1)  the  nodes  in  the 
first  level  direct  toward  one  node  in  the  second  level  and 
leave  one  node  in  the  second  level  without  any  first-level 
nodes  directed  to  it,  (2)  two  second-level*  nodes  each  have 
two  first-level  nodes  directed  to  them,  and  (3)  one 


The  be»ef*  of  J,  is  I2+32.+4MS 


THE  BENEFIT  OF  THESETWO  EDGES 

is  32+4=36 


Tz  •  The  benefit’  of  T2  is  I6-H&+  It  *4^ 


“ :  The  ben«pi  T3  is  i2-Hb  +  <?  =  3& 


THE  BENEFIT  OF  TH£S£TW0  EDGES 
IS  l6+?«24 


Figire  3,5  Three  types  of  spanning  trees  toward  node  v* 
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second-level  node  has  two  first-level  node  directed  to  it 
and  the  other  second  level  node  has  only  one  first-level 
node  directed  to  it.  We  illustrate  the  three  cases  as  in  T,, 
T j,  and  Irrespectively  in  figure  3.5. 

As  ve  can  see  from  figure  3.4  and  figure  3.5,  ve  know 

*  The  benefit  of  the  first  level  in  figure  3.4  is  constant 
12q. 

*  The  maximun  benefit  of  the  second  level  in  figure  3.4  is 

32q  +4  (r  -  q)  which  occur  when  exactly  three  t. 

connect  to  one  <Tf  . 

So  if  there  exists  a  spanning  tree  toward  vff  with  cost 

no  less  than  B  (  A  40q  ♦  4r  ) ,  then  this  must  be  a  spanning 

tree  with  exact  three  z  connect  to  one  <F 

J 

Let 


C'  -  {  (F'  |  exact  three  Z -  connect  to  one  <F 

*  J 

in  the  spanning  tree  } 

Then,  C*  is  the  exact  cover  of  Z3C. 


Following  the  same  proof,  if  we  restrict  d  *•  ■  1/2  for 

J 

all  i,  j  then  the  problem  is  still  NP-complete. 


3.4  Conclusion: 

We  have  proven  that  three  variations  of  the  query 
processing  problem  (QP)  are  NP-complete  problems.  The 
complexity  of  the  query  processing  problem  is  still  open,  it 
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is  a  future  research  problem.  The  difficulty  of  solving 
query  processing  problems  comes  from  the  fact  that  the  cost 
of  each  stage  depends  on  the  cost  of  all  previous  stages. 
That  is,  we  need  to  search  over  all  the  inversely  directed 
spanning  trees  toward  the  final  node  v  and  compute  the  cost 
of  them.  The  solution  space  is  very  large.  The  other 
complexity  comes  from  the  fact  that  at  each  stage  (node),  we 
want  to  choose  the  next  node  to  move  and  process;  we  have 
two  objectives:  one  is  the  minimization  of  the  immediate 
transmission  cost  and  the  other  is  the  maximization  of  the 
data  being  reduced  after  moving  the  data  from  one  node  to 
the  other  and  performing  the  join  operation  between  them. 

Because  of  the  results  of  this  chapter  and  the  dynamic 
programming  problem  nature  of  the  query  processing  problem, 
we  do  not  intend  to  look  for  optimal  algorithms.  Instead, 
our  objective  in  the  next  chapter  is  to  provide  heuristic 
algorithms  for  choosing  routing  strategies  for  processing  a 
given  query. 

Future  research  direction  will  be  determing  the 
complexity  of  query  processing  prblems  in  the  model  of 
chapter  two. 


Ill 


Chapter  4 

Heuristic  Algorithm  for  Distributed  Query  Processing  Problem 

4.1  Introduction 

In  the  last  chapter,  we  have  shown  that  under  three 
different  objective  functions,  the  problem  of  finding  a 
routing  strategy  for  sending  required  data  to  the  site  where 
a  query  is  initiated  is  NP-complete.  The  exact  algorithm  for 
this  problem  must  enumerate  all  possible  solutions  and 
calculate  their  costs  because  of  their  dynamic  network 
nature.  When  the  number  of  nodes  is  small,  the  enumeration 
algor  it'*;?  is  possible  and  hopefully  can  generate  an  answer 
within  a  reasonable  response  time.  Since  many  of  the  queries 
issued  by  users  are  for  real-time  application,  response  time 
is  an  important  factor  for  the  users.  The  algorithm  for  a 
generating  routing  strategy  must  be  very  efficient  in  order 
to  meet  the  requirement.  Heuristic  algorithms  appear  to  be 
the  only  reasonable  option  for  solving  such  problems. 

In  this  chapter,  we  will  analyze  the  problem  and 
provide  heuristic  algorithms  for  the  problem.  He  first 
consider  the  simpler  case  of  the  problem  where  all  possible 
semijoins  are  performed  and  only  consider  the  routing 
strategies  for  join  operations.  He  provide  several  heuristic 
algorithms  for  this  problem.  He  then  extend  the  algorithms 
of  the  simpler  case  to  the  general  distributed  query 
processing  problem  of  solving  an  equi-join  query  by  a 


sequence  of  join-semi join  mixed  operations.  Numerical 
examples  are  given  to  show  how  the  algorithm  work. 

Before  we  discuss  algorithms,  we  shall  introduce 
several  graphical  terminologies  as  used  in  graph  theory.  An 
inversely-directed  spanning  tree  is  a  directed  spanning  tree 
with  all  edges  pointing  in  the  direction  of  the  root  of  the 
tree.  A  branch  of  a  node  is  a  subtree  of  that  node.  Consider 
a  directed  path  (x,  ,x2)  (x2,x3)  ....  ( x„,,,  xK)  passing  through 
nodes  x(  ,x2,...,xK  in  a  graph.  The  length  of  this  path  is 
k,  i.e.,  the  number  of  nodes  in  the  path.  For  i  and  j  so 
that  l<i*j<k,  x;  is  a  successor  of  x.  and  x.  is  a 

J  A 

predecessor  of  Xj  .  If  j*i+lr  we  shall  use  the  terms 
immediate  successor  and  immediate  predecessor,  respectively. 
A  node  with  no  successor  is  a  terminal  node  (or  final  node), 
and  one  with  no  predecessors  is  an  initial  node.  A 
nonterminal  node  is  at  level  k  in  a  graph  if  the  longest 
path  from  it  to  a  terminal  node  is  of  length  k.  The  level  of 
a  terminal  node  is  defined  to  be  1.  A  sequence  of 
join-semi join  operations  is  a  sequence  of  join-semi join 
edges  in  the  query  processing  graph  which  we  defined  in 
chapter  2. 

4.2  Cases  Where  All  Semijoin  Reducibilities  Equal  Zero 

In  this  section,  we  consider  a  simpler  case  which 
assumes  that  all  possible  semi joins  were  performed,  i.e., 
all  semijoin  reducibilities  between  each  pair  of  nodes 


become  zero.  We  also  assume  that  the  join  reducibilities  are 
not  changed  or  affected  by  the  join  operation.  This 
assumption  simplifies  the  problem  by  ignoring  the  possible 
dynamic  changes  of  join  reducibilities  after  join  operation. 

We  describe  the  problem  in  the  following:  [SQP] 


Given  a  complete  directed  graph  G-(V,E)  with  V»{  ,v( 

,  ...,v^  7  and  E»{  (v-  ,vj  )|  V  i  i  j  Assume  vQ  is  the 
answering  node  of  a  query.  For  each  node  v-  ,  we  denote  the 
data  size  of  that  node  by  s^.  With  each  pair  of  nodes  (v. ,Vj 
),  we  associate  a  unit  transmission  cost  c.j  of  that  edge 
and  a  reducibility  d  -  between  relations  in  node  vj  and  Vj  . 


The  reduciblity  d^j  between  relations  in  nodes  v- 


and  vj 


means  that  if  we  send  a  relation  from  node  v*  to  node  vj  and 


perform  a  join  operation  locally  at  node  ,  the  resulting 

data  at  node  will  have  size  s.*  Sj*  d..  .  In  fact,  this  d-j 
has  the  same  interpretation  as  1-  Qij  where  fij  is  the  join 
reducibility  between  relations  in  node  v.  and  vj  .  Figure 
4.1  gives  an  example  of  this  problem.  The  total  transmission 
cost  of  the  solution  strategy  is  computed  by  taking  the  sum 
of  the  subtree  cost  of  each  branch  of  the  tree.  Figure  4.2 
shows  an  example  of  the  transmission  cost  of  the  operation 
between  a  pair  of  nodes.  The  costs  are  labelled  next  to  the 
edges.  Our  objective  is  to  find  a  directed  spanning  tree 
toward  node  V0  with  minimum  total  communication  cost. 


4.2.1  Analysis 


Figure  4,1  Example  of  SOP  problem 


Total  cost  of  this  subtree  is 

S 3-Cu+  c2-czx  +  s,-«s,-s,-d  ar^rci 


Figure  4.2  Example  of  computing  subtree  cost 
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In  this  problem,  we  have  two  types  of  objectives  at 
each  step.  The  first  type  is  to  minimize  the  next  step 
transmission  cost,  i.e.  node  i,  min  (c  c:  }.  The  second 

i  J 

type  concerns  data  reduction  after  this  operation.  It  can 

be  either  finding  the  minimum  of  the  data  required  to 

transmit  after  the  first-step  operation,  i.e.  m\n  {d»  •  s. 

J  J  J 

}  or  finding  the  maximum  of  the  data  being  reduced  after  the 

first-step  operation,  i.e.  max  { (1-d  .? ) • s. } .  In  order  to 

j  J  J 

find  the  next  step  operation,  we  either  can  consider  only 
one  of  the  two  types  of  objectives  or  consider  the  weighted 
combination  of  those  two  objectives.  This  type  of 
heuristic  algorithms  forms  a  solution  in  a  single  pass  by 
selecting  operations  sequentially  in  an  order  that 
minimizes  the  increase  in  the  objective  at  each  step. 
We  call  this  type  of  algorithm  a  greedy  heuristic  because 
of  its  appetite  for  immediate  improvement.  We  will  give  an 
analysis  of  this  problem  in  this  section  in  order  to  provide 
a  constructive  insight  of  the  heuristic  algorithm. 

We  define  a  node  label  p.  for  each  node  to  be 
the  transmission  cost  of  the  next  edge  in  the  solution 
tree.  Initially,  we  assume  all  node  labels  p-»  1.  As  the 
solution  tree  grows,  the  number  of  nodes  in  the  tree 
increases  and  the  node  labels  will  update  accordingly. 
Fig.  4.3  gives  an  example  of  updating  node  labels  of  the 
example  in  fig.  4.2 
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At  any  stage,  if  we  want  a  new  node  to  be  included  in 
the  solution  tree,  we  need  to  compare  the  total  transmission 
costs  of  all  possible  cases.  For  example  in  fig.  4.3,  if 
node  4  is  the  next  node  to  be  included  in  the  solution  tree, 
it  can  be  sent  to  node  0,1,2,  or  3.  We  need  to  compare  all 
four  possible  cases  and  choose  the  one  with  minimum  totrl 
communication  costs.  See  fig.  4.4.  In  general,  if  S 
is  the  set  of  nodes  in  the  solution  tree  at  this  stage  and 
node  k  is  the  node  which  will  be  included  in  the  solution 
tree,  we  need  to  compare  the  costs 


Min 

its 


sk*ck£ 


w 


J  in 


p*.  ) 


HZ  P-1  (4.1) 

j  ncrt  »«p*H>(C,o)  J 

}*> 


and  choose  a  minimum  one.  Assuming  node  r  to  be  such  a 
node,  we  include  node  k  in  the  solution  tree  by  adding 
the  edge  (krr).  The  node  labels  are  changed  as  follows: 

Pi  *  sK*c*r 

p!  ■  sK*dKY(p;)  V  j  in  Pat**  (r»0)  6  j  ♦  0. 

J  Q 

p!  -  p.  otherwise. 

Figure  4.5  gives  the  node  label's  updating  of  Figure  4.4 
where  node  3  is  the  node  which  was  selected. 

Assuming  the  graph  has  node  V,  and  n  other  nodes,  and 
the  number  of  nodes  in  the  solution  tree  S  at  this  stage 
is  m,  the  number  of  operations  at  each  stage  is  then  0(n*m). 


Figure  4.4  Including  a  new  node  into  solution  tree 


Figure  4,5  Updating  node  labels 
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So  the  number  of  computations  of  this  greedy  algorithm 
requires  0(n^  ).  We  will  identify  several  special  cases  in 
which  the  computations  are  reduced. 


Let  I»{i  J  C.  ■  min{Ct;]}  be  the  set  of  nodes  where  the 

*Q  3  J 

minimum  unit  transmission  cost  is  desired  when  sending 

the  query  answering  node  0,  and  J  ■  {i(m^n  {s.*dy}  <  1  }  be 

jio 

the  set  of  nodes  where  there  exists  a  non-final  node  at 
which  data  can  be  reduced  to  a  smaller  size  by  sending  the 
data  from  node  i  and  performing  the  local  processing. 


Given  a  node  i,  we  can  classify  into  one  of  the 
following  four  cases: 


Case  1:  C.0»min  {C;j  ]  and  3k  so  that  s.^-d.^ 
J 

<  1 


min  {s.-d-r  1 

j+0  *  AJ 


Since  the  node  labels  will  change  as  P.  ■  s.* d. 

J  At  AtK 


(P:  )  V  j  in  path  ( k , 0  ),  j  \  0,  if  we  add  the 

J 

edges  (i,k),  and  s^d^  <  1  ,  it  will  reduce  the 


total  communication  cost  of  the  path  (k,0).  We 
need  to  compare  the  tradeoff  between  the 
payment  for  the  first  step  transmission  cost  and  the 
gain  from  the  saving  of  the  path  transmission  cost. 


The  strategy  of  choosing  k  is  such  that  k  is 
the  outermost  node  of  a  branch  with  larger 


and  (1  -  s.*  d 


r,  P> 

jtfpwrhf^o)  9 


. —  P.  ).br  £  s.(c  - 


=«>• 


Case  2:  i^I  and  i€J. 


Let  R..  (k  |  C**  Cj.end  1  }. 

Lemma:  If  K ■ ,  then  «e  never  use  edge  (i,0) 

9* 


Proof : 


Since  Sj*  d>i|t<  1  implies 


jep**(K,o)  PJ  )  *  j«p^?hCK,o)  P. 


and  s.c-*  $  s.  c.„  ,  the  cost  of  sending  data  from 

A  Al'  A  A9 


node  i  to  node  0  will  have  a  higher  communication 
cost. 

If  »<j>,then  we  must  compare  all  options  as  in  the 
general  case. 


Case  3:  i£  I  and  i 

Since  i €  I ,  so  »x*cCj  y  \*CaX>  '  Vj*  and  *  ^  3 
implies  s-*d..  >  1.  The  cost  of  sending  data  directly 

A 


from  i  to  0  will  be  s.*  c._ .  And  the  total  cost 
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will  be 


,2Ii  .  p;  .  (4.3) 

Uo2«j  j 


S.C.  +  . 

a  *0  ofiJLWesj 
i+o 

The  cost  of  sending  data  to  node  k  will  be  s.<  c. 


>cK 


and  the  total  cost  will  be 


s.- c-  +  s.-dVlt  (  p.)  +  P. 

>  aic  *  ak  jepetfcU,©)  J  J 


(4.4) 


J*0 


Formula  4.4  is  always  greater  then  (4.3),  so  we  have 


Lemma:  If  i€l  and  i  ^ J  ,  then  the  minimum  routing 
is  through  edge  (if0). 

Case  4:  i£I  and  i^J. 

In  this  case,  we  need  to  compare  all 
possible  options  to  include  node  i  in  the  solution 
tree.  We  use  formula  (4.1). 

4.2.2  Heuristic  Algorithms 

In  general,  a  heuristic  algorithm  can  be  divided  into 
two  stages.  The  first  stage  is  to  build  a  feasible 
solution.  If  the  algorithm  for  generating  an  initial 
feasible  solution  is  good  enough  to  provide  a  solution 
close  to  the  optimal  solution,  then  the  algorithm  itself  can 
be  used  as  a  heuristic  for  the  problem.  The  second  stage 
is  the  improvement  stage.  At  this  stage,  the  heuristic 
attempts  to  improve  an  initial  feasible  solution  by  some 
method  to  obtain  a  better  solution.  The  most  often  used 
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method  is  the  interchangeable  heuristic  which  attempts  to 
improve  an  initial  feasible  solution  by  interchanging 
some  nodes  in  the  solution  with  some  that  are  not  in  the 
solution.  This  process  continues  until  a  solution  is 
found  that  cannot  be  improved  by  such  interchanges. 

In  this  section,  we  will  seperately  describe  the 
algorithms  for  building  an  initial  feasible  solution 
and  for  solution  improvement.  Any  combination  of 
algorithms  for  those  two  stages  will  be  an  algorithm  for 
[ SQP ] . 

We  will  first  describe  the  algorithms  for  building 

an  initial  feasible  solution. 

According  to  the  analysis  of  section  4.2.1,  we  can 

easily  generate  an  algorithm  to  build  an  initially 
feasible  solution  that  is  an  inversely  directed  spanning 

tree  toward  node  v  ,  gradually  letting  T  be  the 

solution  tree.  Initially  T  is  empty.  We  also  assume  each 
node  i  has  associated  with  it  a  node  label  p  .  Initially 
P  -  1  Vi.  The  algorithm  is  as  follows: 

Algorithm  A: 

(1)  T  •f  initially 

(2)  y  node  t€V  -  {v  |  c^-min  {cyj  1  6  Sj  dJ(c  £  1  yj  }, 
includ  (t,0)  into  the  tree.  i.e.  T  ■  {  (t,0)[ytev 
1 

(3)  Sort  the  remaining  nodes  in  the  increasing  order  of 
the  size  of  the  data  at  each  node.  Assume  that  the 


ord.r  is  «WS  «„„»••••?  »rnr 

(4)  Add  the  nodes  in  the  tree  T  according  to  the 
order  of  this  sequence 

(5)  In  adding  node  v^into  the  tree  T,  choose  a  node  t 
already  in  the  tree  so  that  adding  the  edge  (v(J),t 
)  gives  a  minimum  cost  increase. 

(6)  update  node  labels  according  to  formula  (4.2) 

(7)  stop  when  all  nodes  are  included  in  the  tree. 

This  algorithm  is  a  greedy  heuristic  algorithm  because 
t  forms  a  solution  by  selecting  an  edge  which  minimizes  the 
increase  in  the  objective  at  each  step. 

If  we  consider  only  one  objective,  the  first-step 
transmission  cost,  then  we  can  transfer  this  problem 
into  a  minimum-directed  spanning  tree  problem. 

Algorithm  B: 

(1)  Build  a  complete  directed  graph  with  one  node 
corresponding  to  one  database  site. 

(2)  Associate  each  edge  with  the  cost  of  the  first  step 
of  the  transmission,  i.e.  s.*c..for  edge  (i,j) 

A  fcj 

(3)  Find  a  minimum-directed  spanning  tree  toward  node 

0. 

Since  there  exists  a  polynomial  time  algorithm  for  the 
minimum  directed  spanning  tree  problem,  this  solution  can  be 
solved  easily.  Since  this  algorithm  considers  only  one 
objective  and  ignores  the  other  objective,  reducing  the 


resulting  data  after  this  operation,  the  worst  case  of  this 
algorithm  could  be  very  bad..  So  we  expect  that  this 
algorithm  must  be  used  with  an  improvement  method  to 
provide  an  algorithm  for  [SQP]. 

Another  algorithm  is  obtained  by  considering  the 
objective  of  reducing  data  by  intermediate  processing  only, 
and  ignordinge  the  first-step  transmission  cost.  As  with 
algorithm  B,  we  expect  that  this  algorithm  must  be  used 
with  an  improvement  method  to  provide  an  heuristic 
algorithm  for  [SQP].  We  describe  the  algorithm  as 
follows: 

Algorithm  C: 

(1)  Build  a  complete  graph  with  one  node  corresponding 
to  one  database  site 

(2)  Associate  each  edge  (i,j)  with  the  amount  of 
data  being  reduced,  s.*Sj(l-d6j) 

(3)  Find  a  maximum  matching  of  the  graph 

(4)  Combine  the  two  nodes  in  a  matching  as  one  and  build 
another  complete  graph  in  which  each  edge  is 
associated  with  the  new  amount  of  data  between  this 
pair  of  nodes,  and  find  a  maximum  matching  again. 

(5)  Repeat  this  procedure  until  the  number  of  nodes 
in  each  branch  of  the  tree  is  less  than  a  fixed 
number. 


Another  algorithm  is  to  take  a  weighted  combination  of 
these  two  factors.  If  we  let  w.»  djj  /  (n-1}  which  is  the 
average  unit  transmission  cost  from  node  i  to  the  other 
node,  then  we  can  associate  each  edge  (i,  j)  with  a 
combination  of  first-step  transmission  cost  and  weighted 
second-step  transmission  cost,  i.e.,  weighted  data  reducing 
factor.  This  approach  is  expected  to  generate  a  better 
solution  than  algorithm  B  and  algorithm  C. 


Algorithm  0: 


(1)  Build  a  completed  directed  graph  with  one  node 
corresponding  to  one  database  site. 

(2)  Associate  each  edge  with  the  combination  of 

the  first-step  transmission  cost  and  the 
weighted  data  reducing  factor  of  intermediate 
processing,  i.e. 

y  ccj  *  Vd« ysy¥i  for  edse 

(3)  Find  a  minimum  directed  spanning  tree  toward  node 

0. 


The  algorithm  for  solution  improvement  is  called 
the  interchange  heuristic.  This  method  starts  from  one 
approximate  solution  and  then  perturbs  it  somewhat  to  see  if 
a  better  solution  results.  If  a  better  solution  does 
result,  the  original  solution  is  discarded  and 
perturbations  on  the  new  solution  are  tried.  Methods  of  this 
kind  for  the  traveling  salesman  problem  are  described  in 
[LIN  65]  and  [RSL  77].  We  generalize  these  techniques  as  a 
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perturbation  method  in  our  heuristic  algorithms. 

Define  a  k-change  of  a  problem  as  the  deletion  of  k 
edges  and  their  replacement  by  other  k  edges  so  that  another 
solution  is  obtained.  We  also  define  a  solution  as  k-optimal 
if  no  k-change  produces  a  better  solution. 

We  describe  the  algorithms  for  solution  improvement  .  by 
k-interchange  heuristics  in  the  following. 

Algorithm  k-interchange: 

(1)  Obtain  a  initial  feasible  solution. 

(2)  Repeat  apply  k-change  to  the  current  solution  until 
the  k-optimal  has  reached. 

The  heuristic  algorithm  for  solving  SQP  can  be  the 
combination  of  one  of  the  algorithms  A  through  D  with 
algorithm  k-interchange  for  some  fixed  k. 

4.3  Heuristic  Algorithm  for  Distributed  Query  Processing 

Problem 

For  the  general  query  processing  problem,  we  want  to 
solving  a  query  by  a  sequence  of  join  and  semi  join 
operations.  Our  heuristic  algorithm  is  to  generate  a 
sequence  of  join  operations  for  solving  the  query  with 
minimun  cost  by  a  similar  algorithm  of  algorithm  A.  This  is 
an  incremental  method  for  building  up  the  solution.  Then  we 
check  each  edge  to  determine  whether  there  exists  a  semi  join 
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operation  which  will  make  the  join  operation  have  less 
transmission  cost.  If  such  a  semijoin  operation  exists,  then 
we  replace  the  current  solution  strategy  by  adding  this 
semi join  edge  to  it.  We  describe  the  algorithm  as  follows: 

Algorithm  B: 

(1)  T  •f  initially 

(2)  Vnode  t€V  -  {v  |  cyo  -min  {cyj  }  6  Sjdyj.  £  1  V  j  1. 
includ  (t,0)  into  the  tree.  i.e.  T  ■  {  (t,0)|yt€V 
1 

(3)  Sort  the  remaining  nodes  in  the  increasing  order  of 

the  size  of  the  data  at  each  node.  Assume  that  the 
order  is  s^,  >  ...>  s  .  ,  where  s  r  .*  width 

of  the  relation. 

(4)  Add  the  nodes  in  the  tree  T  according  to  the 
order  of  this  sequence 

(5)  In  adding  node  v  into  the  tree  T,  choose  a  node  t 

W) 

already  in  the  tree  so  that  adding  the  edge  (v^f  ,t 
)  gives  a  minimum  cost  increase. 

(6)  update  node  labels  according  to  formula  (4.2) 

(7)  After  all  nodes  are  included  in  the  tree,  we  check 
each  node  to  determine  whether  there  exists  one 
semijoin  edge  which  will  reduce  the  original 
transmission  cost.  If  there  exist  one,  then  include 
the  semi join  edge.  The  order  of  the  checking 
sequence  is  from  the  terminal  node  up  to  its 
predecessor. 

(8)  Stop  until  all  nodes  have  been  checked  for  semijoin 
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operations. 

We  expect  to  have  a  numerical  example  for  this 
algorithm.  We  also  hope  to  generate  other  efficient 
algorithms. 

4.4  Conclusions 

In  this  chapter,  we  analyzed  the  difficult  nature  of 
the  query  processing  problem  and  provided  an  analytical 
basis  for  heuristic  algorithms.  We  first  considered  the 
simpler  case  of  the  problem  where  all  possible  semijoins  are 
performed  first,  i.e.,  all  semi  join  reducibilities  become 
zero.  We  provide  several  heuristic  algorithms  for  this 
problem.  Each  of  the  algorithms  has  two  stages.  The  first 
stage  is  to  find  a  feasible  processing  strategy.  The  second 
stage  is  the  improvement  stage  where  interchange  procedures 
are  used.  We  then  extend  those  heuristic  algorithms  to  the 
general  query  processing  problem  by  including  semijoin 
operations  into  the  sequence  of  join  operations.  We  first 
create  a  solution  strategy  using  only  join  operations  and 
then  change  a  join  operation  to  one  semijoin  operation  and 
one  join  operation  when  it  is  beneficial  to  do  so. 

Future  research  direction  is  the  study  of  the  analytic 
behavior  of  those  heuristic  algorithms.  Although  some 
analytic  results  of  worst  case  and  average  case  analysis  are 
difficult  to  obtain,  some  computational  experiments  may  be 
conducted  to  get  a  feeling  for  the  average  performance  of 
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the  algorithms.  Developing  more  efficient  and  general 
solution  procedures  for  general  query  processing  problems  is 
also  a  future  research  area. 
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Chapter  5 

Query  Processing  in  Distributed  Heterogeneous  Databases 

5.1  Introduction 

Database  management  systems  are  among  the  most 
important  and  successful  software  developments  in  this 
decade.  They  have  already  had  a  significant  inpact  in  the 
field  of  data  processing  and  information  retrieval.  The 
existing  commercial  systems  are  almost  exclusively  based  on 
one  of  the  three  data  models:  relational,  netvorM  e.g., 
CODAS YL )  and  hierarchical.  Many  organizations  have 
independently  developed  their  own  databases  on  their  own 
computers  and  database  management  systems  to  support  the 
planning  and  decision  making  in  operations.  Each  DBMS  has 
its  own  intended  schema,  access  control,  degree  of 
efficiency,  security  classification  and  operational 

requirements,  etc.  Often,  different  databases  may  contain 
data  relevant  to  the  problem  although  their  structure  and 
representation  could  be  different.  It  will  be  beneficial  if 
we  can  bring  together  all  these  databases  in  several 
locations  in  order  to  integrate  information  resources  and 
build  new  kinds  of  applications  to  help  operations. 

Heterogeneous  database  management  systems  which  are 
geographically  distributed  around  the  world  play  one  of  the 
most  important  roles  in  command,  control  and  communication 
(C ^  )  systems,  and  other  organizational  operations.  One  of 
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the  main  problems  in  using  these  databases  is  the 
communication  between  them  when  we  need  to  retrieve  and 
update  information.  When  a  user  issues  a  query  at  one  site, 
the  system  must  be  able  to  respond  to  the  user  with  an 
answer  of  the  query  as  soon  as  possible.  In  today's  business 
operations,  efficient  decision  making  based  on  information 
resources  will  depend  increasingly  on  a  more  automated  and 
faster  response  distributed  database  management  system. 

Existing  data  communication  technology  for  computer 
networks  does  not  yet  provide  a  solution  for  the 
communication  between  these  DBMSs.  Communication  delay  of 
data  transmission  is  still  a  dominant  factor  in  system 
performance.  In  order  to  develop  a  method  for  query 
processing  in  distributed  heterogeneous  database  management 
sytems  environments  and  to  develop  a  quantitative  and 
syntactic  understanding  of  the  query  processing  strategies 
optimization,  we  need  to  have  an  integrated  system  to 
combine  and  share  information  in  a  heterogeneous  distributed 
database  environment. 

In  previous  studies,  a  number  of  approaches  have  been 
proposed  to  the  problem  of  heterogeneous  DBMSs.  One  early 
proposal  was  to  restructure  each  database  into  a  common 
structure  under  a  given  DBMS;  that  is,  to  convert  and  to 
migrate  the  entire  database  from  the  various  DBMS's  to  the 
given  DBMS.  This  type  of  approach  is  generally  called  "data 
base  conversion."  [SHL  75],  [SHTGL  77 J,  [SLH  76],  [SO  75], 
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and  [SU  76]  are  works  in  this  direction.  Another  approach  is 
to  maintain  the  original  database  and  provide  an  effective 
information  exchange  among  the  different  systems  without 
incurring  mass  data  migration.  The  major  advantages  and 
functional  characteristics  of  this  latter  approach  're: 

1.  A  global  data  model  is  used  to  provide  users  with  a 
common  schema. 

2.  The  data  bases  can  be  kept  distributed  without 
requiring  any  data  movement. 

3.  Each  data  base  can  both  operate  in  its  own  local 
mode  and  participate  in  the  distributed  system. 

4.  The  application  programs  for  the  original  data  bases 
do  not  have  to  be  changed  and  remain  still  usable. 

The  following  researches  are  in  this  direction:  In  [AD 
77]  and  [AD  78],  the  model  of  Abrial,  [AB  74],  is  used  as  a 
global  data  model.  In  [NBC  76]  and  [CP  80],  the  Entity 
relationship  (E-R)  model  proposed  by  [CHE  76]  is  used  as 
global  conceptual  data  model  and  a  modified  version  of  DIAM 
(Data  independent  accessing  model)  incorporating  the  syntax 
proposed  by  the  C0DAS7L  SDDTG  (Storage  Data  Definition  and 
Translation  Group)  is  used  as  the  global  internal  model  [SDD 
77].  In  [SBD  81],  MULTIBASE  uses  the  Functional  Data  Model 
of  [SHI  79]  as  the  global  conceptual  data  model.  DAPLEX, 
embedded  in  the  programming  language  ADA,  is  used  as  the 
user-interface  language  and  a  subset  of  ADAPLEX  is  used  as  a 
mapping  language.  The  Database  Communication  System  we 
propose  in  this  chapter  can  be  classified  into  the  second 


category.  The  main  differences  of  our  approach  from  others 
are  that  we  take  advantage  of  the  relational  data  model  and 
use  a  very  high-level  non-procedural  language  for  user 
interfaces. 

This  chapter  is  dedicated  to  the  study  of  communication 
among  nonintegrated,  heterogeneous  and  distributed  DBMSs.  A 
concept  of  a  data  communication  system  is  proposed  to 
provide  a  way  to  integrate  and  share  information  in  a 
heterogeneous  database.  The  Database  Communication  System  is 
a  front-end  software  system  of  a  DBMS.  It  presents  to  users 
an  environment  of  a  single  system  and  allows  them  to  access 
the  data  using  a  high  level  data  manipulation  language 
without  requiring  that  the  total  database  be  physically 
integrated  and  controlled.  The  architecture  of  the  database 
communication  system  proposed  in  this  chapter  is  only  a 
conceptual  design.  All  three  components ,  schema  unit,  query 
unit  and  control  unit  require  detailed  requirement 
specifications.  It  is  not  our  intent  in  this  thesis  to 
study  all  of  them.  We  emphasize  only  the  query  unit  in  this 
thesis.  In  the  next  chapter,  we  will  study  how  a  query  is 
processed  in  a  heterogeneous  database  environment.  Schema 
translation  and  query  translation  will  also  be  addressed 
there. 


The  organization  of  this  chapter  is  as  follows:  In 
section  5.2,  we  describe  the  motivations  and  difficulties  of 
heterogeneous  DBMS  and  specify  the  goal  of  this  system 
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design.  In  section  5.3,  we  describe  the  components  of 
heterogeneous  database  communication  systems.  The 
relational  data  model  is  then  chosen  as  a  global  data  model 
to  support  the  communication.  Several  reasons  are  described 
in  section  5.3.1.  In  section  5.3.2,  we  describe  the 
architecture  of  a  database  communication  system  and  the 
functional  characteristics  of  each  of  its  components.  In 
section  5.3.3,  some  network  configurations  are  described 
that  permit  integration  of  heterogeneous  DBMSs  by  using 
database  communication  systems.  In  section  5.4,  we 
described  briefly  how  a  query  is  processed  in  a 
heterogeneous  database  environment  and  we  will  leave  the 
details  of  the  query  processing  to  the  next  chapter. 
Lastly,  several  other  problems  of  database  communication 
systems  requiring  further  detailed  specification  are 
discussed. 

5.2  Motivation  and  Objectives 

5.2.1  The  Heterogeneous  World  of  DBMSs 

In  the  real  world,  resources  are  heterogeous  in  nature, 
e.g.  size,  shape,  color,  structure  etc.  The  same  fact  exists 
in  the  world  of  DMBSs.  There  are  at  least  several  dozens  of 
heterogeneous  DBMSs  commercially  available  today,  e.g., 
IMS,S2000,  TOTAL  IDMS,  etc.  From  several  points  of  view,  we 
can  distinguish  heterogeous  DMBSs. 
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1.  Conceptual  Model  Viewpoint 

Traditionally,  the  data  model  may  be  classified  into 
three  categories:  hierarchical,  network,  and  relational. 
Most  of  the  commercially  available  systems  may  be 
implemented  in  some  variant  of  one  of  the  three  models.  For 
example  ,  IMS  is  hierarchical,  system  2000  is  inverted 
hierarchical,  TOTAL  follows  CODASYL  DBTG  architecture, 
ADABAS  is  inverted  network  and  INGRES  is  relational. 

2.  Physical  Model  Viewpoint 

Although  two  DBMSs  may  have  the  same  conceptual  model 
or  may  even  be  the  same  type  of  DBMS,  they  may  have 
different  data  structures. 

For  example,  the  storing  information  about  courses 
offered  and  courses  taken  by  students  may  well  use  different 
physical  data  structures  to  represent  it  in  a  network  model. 
With  diferent  data  structures,  the  access  paths  will  be 
different. 
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3.  Data  Manipulation  Language  Viewpoint 

The  data  manipulation  language  can  be  record-at-a-time 
or  set-at-time.  In  other  words,  it  can  be  low-level 
procedural  or  high-level  non-procedural.  It  depends  on  the 
conceptual  model  and  physical  model  the  system  has  adapted. 
It  also  depends  on  the  system  itself.  For  example,  in  the 
relational  system.  System  R,  the  language  can  be  SEQUEL  or 
Query-by-Example . 

4.  Application  Viewpoint 

From  an  application  point  of  view,  the  DBMS  can  be 
classified  into  a  general-purpose  system  and  a  special 
purpose  system.  TOTAL  is  >a  general-purpose  DBMS  which  is 
used  for  all  kinds  of  different  application  purposes.  The 
PARS  (Programmed  Airline  Reservation)  System  is  a  special 
system  which  serves  only  a  specialized  application.  The 
systems  used  for  different  purposes  support  different 
facilities. 

5.  Machine  Viewpoint 

The  same  DBMS  can  be  implemented  on  different 
computers.  The  ARPAMET-Datacomputer  system  is  a  typical 
heterogeneous  system  where  quite  different  types  of 
computers  are  tied  together  to  implement  their  own  DBMSs. 
Different  computers  may  differ  in  their  speed,  memory  size, 
storage  management,  etc. 


138 


6.  System  Control  viewpoint 

Viewed  from  the  system  control  aspect,  there  are  two 
types  of  systems:  centralized  v.s.  decentralized  control 
systems.  A  centralized  control  system  assumes  the  existance 
of  one  central  control  function  to  handle  all  systemwide 
global  control.  The  LADDER-FAM  (Language  Access  to 
Distributed  Data  with  Error  Recovery-File  Access  Manager) 
[1,2]  developed  at  SRI  is  an  example.  A  distributed  control 
system  where  the  control  is  completely  distributed  to  each 
subsystem  is  more  reliable.  The  SDD-1  system  of  the 
Computer  Corporation  of  America  [3]  is  an  example  of  this 
type. 


5.2.2  Difficulties  and  Approaches 

The  large  bulk  of  local  data  are  produced  at  a  variety 
of  locations  in  many  fields.  In  business,  scientific 
research  and  government,  the  data  exchange  is  very  important 
in  decision  making,  experiment,  management  and  control.  The 
difficulties  of  communications  between  heterogeneous  DBMSs 
can  be  identified  as  follows. 

1.  Data  model  -  the  conceptual  models  for  different 

DBMSs  may  be  different.  A  user  having  a  knowledge  of  one 
system  may  not  be  familiar  with  another  system.  Selection  of 
a  data  model  for  every  system  to  provide  a  uniform  view  to 
the  end  user  is  essential. 
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2.  Data  definition  language  -  in  addition  to 

selecting  a  data  model,  a  data  definition  language  to 
support  the  description  of  conceptual  scheme  is  also 
essential.- 

3.  Data  manipulation  language  -  the  user's  query 

language  cannot  be  the  one  for  local  host  schemes;  it  must 
be  a  query  language  which  supports  the  global  uniform 
scheme.  Because  the  end  users  don't  know  what  data  model  the 
query  will  have  to  deal  with,  they  are  obviously  unable  to 
specify  how  something  must  be  done,  and  so  must  instead 
specify  what  is  to  be  done,  i.e.,  the  language  must  be 
nonprocedural. 

4.  Data  integration  -  most  of  the  data  base  set  up  by 

independent  organization  are  hard  to  integrate.  It  is  also 
possible  that  inconsistencies  exist  between  copies  of  the 
same  information  stored  in  different  databases.  Combining 
all  local  schema  together  to  form  a  global  schema  is 
necessary  in  order  to  provide  an  integration  schema  for 
them. 


5.  Data  incompatibilities  -  the  same  objects  in 

different  DBMSs  may  be  represented  in  different  types, 
different  schema  names,  different  scales,  etc.  When 
integrating  the  DBMSs,  we  need  to  recognize  these 
incompatibilities  of  data  sources  and  identify  them  in  the 
integration  schema. 
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6.  Processing  results  —  once  a  result  is  gotten  for  a 
query,  it  is  expressed  in  the  form  of  the  original  data 
model,  and  it  must  be  translated  to  the  uniform  data  model. 
Can  this  current  result  be  saved  and  be  operated  on  later? 

7.  Data  dictionary  and  directory  schema  -  We  must 

provide  each  end  user  with  a  uniform  directory  so  that  he  is 
able  to  see  easily  what  data  is  available,  where  it  is,  and 
how  to  get  it. 

8.  Access  planning  - —  with  a  high-level  query 
language,  the  system  should  provide  an  optimizing  strategy 
for  each  query  in  a  distributed  system. 

9.  Multiple-systems  access  -  each  query  may  reference 

data  in  two  or  more  different  systems.  The  system  must 
coordinate  their  transactions. 

10.  Multiple  view  support  -  If  the  system  wants  to 

support  multiple  schemes  for  each  DBMS,  so  that  users  can 
have  freedom  to  choose  their  own  preferred  query  language 
and  global  schema,  then  the  systems  must  add  more  schema 
translators  and  query  translators. 

11.  Control  system  -  After  integrating  different 

DBMSs,  the  system  has  to  have  a  system  controller  to  control 
the  network  DBMSs.  The  data  manager  must  decide  whether  to 
use  centralized  control  or  distributed  controls. 


5.2.3  Design  Objectives 


Before  we  set  up  the  design  approach,  it  is  important 
to  decide  what  goals  we  want  to  achieve: 

1.  Central  view  for  users  —  All  user's  views  are 
defined  upon  a  global  conceptual  schema  which  is  the  union 
of  the  local  schemata  and  integration  schema.  It  is  hoped 
that  from  the  user's  point  of  view,  the  system  behaves  in 
the  same  way  as  in  a  centralized  system  and  the  user  is 
unaware  of  the  fact  that  he  may  be  dealing  with 
heterogenous  local  databases. 

2.  General  to  any  system  -  We  wish  the  database 

communication  system  to  be  general  to  any  system  and  that  it 
can  be  used  to  integrate  various  database  systems  for 
various  applications.  In  addition,  we  want  to  minimize  the 
cost  and  effort  and  maximize  the  performances. 

3.  Flexible  to  future  extension  -  We  know  that  the 

volume  and  the  complexity  of  database  are  extending  very 
rapidly.  It  is  the  major  factor  of  the  cost  of  maintenance. 
We  want  the  system  to  be  flexible  for  the  future  extension 
with  minimum  cost. 

4.  Reliability  —  The  communications  between 
heterogeneous  DBMSs  should  not  fully  rely  on  a  centralized 
system.  The  communication  capability  should  be  distributed 
among  every  heterogeneous  DBMS. 
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5.  Distributed  control  —  Based  on  the  reliability  and 
parallel  processing  issues,  we  want  the  communication 
between  DBMSs  to  have  distributed  control. 

6.  Security  —  When  combining  heterogeneous  DBMSs, 
some  confidential  data  in  one  system  should  often  not  be 
accessible  so  that  access  can  be  checked  and  the  data 
protected. 

5.3  Heterogeneous  Database  Communication  Systems 
5.3.1  Data  Model 

Because  we  are  dealing  with  communications  between 
different  DBMSs  supported  by  different  data  models, e.g. 
hierarchical,  relational,  etc.,  our  approach  is  to  select  a 
data  model  to  support  a  uniform  conceptual  schema  for  each 
DBMS.  It  provides  users  with  a  homogeneous  view  of 
conceptual  schema  and  also  serves  as  a  bridge  between  the 
underlying  models.  Many  logical  data  models  have  been 
proposed  which  model  the  real  world  in  terms  of  the 
interested  objects  and  the  interrelation  between  them.  In 
[RER  76],  the  authors  study  23  data  models  and  attempt  to 
establish  the  similarities  and  differences  among  them 
according  to  data  model  structure,  logical  access 
type, semantics  and  terminology.  Recent  research  has  focused 
on  two  directions.  One  is  to  enhance  the  refinement  of  the 
conventional  data  models.  The  notion  of  "normal  form  theory” 
has  led  to  a  refinement  of  the  relational  model  which 
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attempts  to  catch  up  more  semantics  information  by 
explicitly  expressing  functional  dependencies  among  data. 
Many  authors  worked  along  this  direction  and  built  some  type 
of  semanatic  data  models.  The  second  approach  has  been  to 
emphasize  the  identification  of  a  basic  simple  construct. 
This  construct  is  simple  and  with  clean  semantics.  It  may 
be  easily  collected  in  a  meaningful  fashion  to  represent 
complex  varieties  in  semantic  structures.  It  is  clear  that 
there  is  no  mental  model  which  is  so  superior  that  it  is 
good  for  all  users. 

In  view  of  the  state  of  the  art,  we  choose  a  relational 
data  model  as  a  global  data  model  to  provide  a  central  view 
to  the  users'  bases  for  the.  following  reasons: 

1.  The  relational  data  model  shields  the  user  from  data 
formats,  access  methods  and  the  complexity  of  storage 
structures. 

2.  It  supports  a  high-level  non-procedural  query 
language. 

3.  The  storage  and  data  structures  are  very  simple; 
all  data  is  represented  in  the  form  of  records. 

4.  Access  paths  do  not  have  to  be  predefined.  A  number 
of  power  operators  are  supported  in  relational  model,  e.g., 
select,  project,  join,  etc.  for  data  retrieval. 

5.  Because  of  the  decline  of  hardware  cost  and  the  rise 
of  manpower  cost,  a  high-level  nonprocedure  manipulation 
language  is  necessary  to  minimize  the  user  workload. 
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6.  The  relational  model  provide  a  simple  and  powerful 
interface  to  the  data. 

7.  The  relational  model  has  fast  response  to  ad  hoc 
queries  which  are  considered  to  be  the  high-percentage  of 
queries. 

8.  The  advance  in  associative  storage  devices  offers 
the  potential  of  greatly  improving  the  efficiency  and 
therefore  the  performance  of  a  relational  system. 

Based  on  this  choice,  we  propose  a  database 
communication  system  which  incorporates  distributed 
heterogeneous  systems  into  a  unified  entity  and  shares  the 
information  resources  in  a  distributed  manner. 

5.3.2  Architecture  of  Database  Communication  Systems 

Although  the  heterogeneous  database  management  systems 
are  usually  geographically  distributed,  the  existing 
approach  for  communication  between  heterogeneous  DBMSs 
builds  a  single  control  system  which  coorperates  and 
communicates  between  different  DBMSs  by  using  the  computer 
network.  One  asks  why  shouldn't  the  database  control  also 
spread  through  each  coorperating  DBMS?  Hopefully  doing  so 
will  provide  a  better  use  of  data  resources  and  improve  the 
performance  and  reliability. 

Our  approach  is  to  define  a  database  communication 
system  which  serves  as  a  front-end  processor  of  local  DBMSs 
as  an  interface  to  the  computer  network.  Zt  is  a  software 


system  aimed  to  link  geographically  distributed 
heterogeneous  DBMSs  together  and  to  act  as  a  bridge  for 
communication  between  local  DBMSs  (see  Figure  5.2). 

The  basic  underlying  assumptions  are: 

1.  It  is  possible  to  exchange  information  among  the  various 
systems  and  they  are  willing  to  maintain  information. 

2.  Each  DBMS  is  considered  to  be  able  to  execute  a  given 
local  transaction. 

3.  There  exists  a  communication  network  which  connects  the 
various  DBMSs. 

4.  The  access  to  a  local  DBMS  is  not  affected  by  the 
operation  of  the  data  communication  system  which  should 
be  transparent  to  the  local  user. 

Functional  Characteristics 

The  database  communication  system  consists  of  three 
major  units:  (see  Figure  5.3) 

*  Schema  Unit 

*  Query  Unit 

*  Control  Unit 

The  functional  characteristics  of  each  component  within  a 
unit  are  described  separately  in  order  to  maintain  the 
modularity. 

A.  Schema  Unit: 

The  schema  unit  maintains  the  local  schema  and 
integrity  schema,  it  consists  of  three  components.  We  will 
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describe  each  component  following  by  its  important 
functions,  (see  Figure  5.4) 

(A.l)  Schema  Translator 

*  Reads  a  schema  description  of  the  local  DBMS 
and  translates  it  into  a  schema  description 
in  a  global  data  model  and  vice  versa. 

*  This  is  done  by  a  mapping  of  the  data 
definition  language  and  the  structure  of  the 
data  model. 

*  The  schema  translator  may  be  different  for 
different  target  DBMS. 

*  A  schema  unit  can  have  several  different 
kinds  of  schema  translators. 

(A. 2)  Local  Schema  and  Global  Schema 

*  Local  schema  is  the  schema  translated  by  the 
schema  translator  from  the  local  host  schema. 

*  Global  schema  is  the  union  of  all  local 
schema  and  integration  schema  of  the  database 
communication  system. 

(A. 3)  Integration  Schema 

*  It  consists  of  information  about  integrity 
constraints,  data  incompatibility,  and  data 
redundancy. 

*  It  is  set  up  at  the  time  a  DBMS  joins  to  the 
heterogeneous  network. 

*  The  component  can  be  viewed  as  a  small 
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database . 


B.  Query  Unit: 

A  query  unit  takes  care  of  the  query  processing, 
optimization  and  access  strategy.  It  consists  of  three 
components,  (see  Figure  5.5) 

(B.l)  Query  Translator 

*  Translates  a  query  in  the  global  query 
language  into  a  query  accepted  by  the  local 
DBMS. 

*  This  is  done  by  a  mapping  of  the  data 
manipulation  language. 

*  The  query  is  parsed  and  simplified. 

(B.2)  Query  Optimizer 

*  The  query  is  decomposed  into  local  subqueries 
which  reference  only  local  schema  and  queries 
which  reference  only  the  integration  schema. 

*  The  distributed  query  algorithm  must  provide 
an  execution  strategy  which  minimizes  both 
the  amount  of  data  moved  from  site  to  site 
and  the  number  of  messages  sent  between 
sites.  In  addition,  the  algorithm  should  take 
advantage  of  the  computing  power  available  at 
all  of  the  sites  involved  in  the  processing 
of  the  query. 

*  The  algorithm  must  also  take  care  of  the 
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query  recomposer  within  the  optimization 
strategy. 

(B. 3)  Query  Recomposer 

*  The  access  strategies  are  then  executed.  The 
results  of  the  execution  are  represented  in 
local  host  schema.  The  final  answer  must  be 
described  in  terms  of  global  schema. 

*  The  result  of  local  queries  must  be  sent  to 
the  answer  site  so  that  the  results  can  be 
put  together  and  reformated  as  the  answer 
expected  by  the  query. 


C.  Control  Units 

(C.l)  Concurrency  Control 

*  The  concurrency  control  algorithm  must  have  a 
synchronization  protocol  to  preserve 
consistency  in  a  distributed  environment. 

*  It  processes  distributed  interleaved 
transaction  by  guaranteeing  that  all  nodes  in 
the  system  process  the  accepted  update  in  the 
same  reference. 

*  Deadlock  detection  or  prevention  mechanisms 
must  be  provided.  When  system  failures 
occur,  the  other  nodes  must  be  able  to 
continue  to  operate  and  the  crashed  nodes 
must  be  able  to  restore  correct  operation. 
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(C.2)  Integrity  Control 

*  There  are  two  levels  of  consistency.  Strong 
mutual  consistency  has  all  copies  of  data  in 
the  system  updated  at  the  same  time.  Weak 
mutual  consistency  allows  various  copies  of 
the  data  to  converge  to  the  same  update 
status  over  time,  but,  at  any  instant  of 
time,  some  copies  may  be  more  up-to-date  than 
the  others. 

*  In  a  C  operational  system,  we  may  want  to 
adapt  weak  mutual  consistency  so  as  to  use 
less  processing  time. 

(C.3)  Security  Control 

*  All  data,  dictionary,  programs  and  services 
must  be  protected  from  unauthorized  access. 

*  All  authorization  information  is  kept  locally 
and  checked  locally. 

*  A  feedback  encryption  and  decryption  system 
must  be  provided  to  each  node  across  the 
communication  network. 

(C.4)  Data  Dictionary/Directory  Schema 

*  Provides  user  with  a  transparent  view  of  the 
directory. 

*  Keeping  information  about  where  various  data 
stored  to  efficiently  provide  the  system 
query  unit  access  data. 


*  Keeping  one  central  master  directory  in  the 
system  with  each  local  DBMS  keeping  a  local 
subset  of  the  control  directory. 

*  These  are  the  "bread  and  butter"  software  for 
a  successful  database  administration 
function. 

5.3.3  Heterogeneous  DBMSs  Network 

By  using  a  database  communication  system  (DCS),  the 
heterogeneous  DBMSs  can  be  integrated  in  several 
interconnection  configurations  according  to  the  desired 
criteria.  For  example,  several  versions  of  the  same  type  of 
system  could  be  grouped  together  under  a  local  database 
communication  system  so  that  they  can  easily  communicate 
without  needing  translation  if  a  query  just  references  the 
local  DBMSs.  The  systems  which  store  similar  data  can  be 
grouped  together  at  a  first  level  so  that  they  can  be  more 
efficient  in  retrieving  data  and  exchanging  information. 
Those  systems  which  store  confidential  data  can  be  put 
together  so  that  the  management  and  security  control  can  be 
handled  more  effectively. 

The  heterogeneous  DBMSs  network  using  a  database 
communication  system  as  a  bridge  for  interconnection  may 
have  one  of  the  following  configurations: 


Star  Architecture  (Centralized  System) 


2 .  hierachical  architecture 
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3 .  Ring  architecture 


4.  Partition  Architecture 


DBMS 


DBMS 


DBMS 


DBMS 


DBMS 
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DBMS 


DBMS 


DBMS 
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Advances  in  hardware,  software,  and  communication 
technology  will  make  the  distributed  DBMSs  possible.  Several 
communication  networks  are  commercially  available  to 
integrate  the  distributed  DBMSs.  For  instance,  ARPANET  is 
one  example  of  a  point-to-point  partially  interconnected 
public  data  network.  It  is  a  packed-switched  network  which 
interconnects  large-scale  computers  worldwide.  Ethernet  is 
one  example  of  a  branching  broadcast  communication  system. 
It  is  a  local  computer  network  for  carrying  digital  data 
packets  among  locally  distributed  computing  stations. 

It  will  also  be  possible  to  interconnect  existing 
networks  of  multiple  systems  with  new  designs,  using  gateway 
technology.  Combinations  of  loop  or  hierarchical  and 
point-to-point  interconnect  technology  will  make  it  feasible 
to  develop  complex  local/remote  systems.  The  combination  of 
low— cost  satellite  communication  links  and  high-speed 
fiber-optics  loop-based  systems  will  also  provide  a  basis 
for  large,  complex,  hybrid  interconnect  structures  to  share 
information  on  distributed  DBMSs. 


5.4  Query  Processing  in  a  Heterogeneous  Environment 

Based  on  the  architecture  of  the  database  communication 
system  (DCS)  we  proposed  in  [HD81]  ,  we  adopt  the  relational 
model  as  the  global  conceptual  model.  It  is  necessary  to 
provide  a  relational  schema  for  each  database.  For  those 
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databases  in  which  the  underlying  data  models  are  not 
relational  models,  schema  translators  will  be  required  to  do 
the  translation  jobs.  A  specific  schema  translator  is  needed 
for  a  specific  data  model  to  translate  the  underlying  schema 
to  a  relational  logical  schema.  The  integration  schema 
consists  of  information  about  integrity  constraints,  data 
incompatibility  and  data  redundancy.  This  integration 
schema  can  be  viewed  as  a  small  database.  For  ** the  query 
which  is  against  this  local  relational  schema,  it  is  also 
necessary  to  provide  tranjlation  rules  to  translate  the 
relational  operations  into  data  manipulation  language 
statements  of  the  underlying  data  model.  The  users  will  see 
the  system  as  a  distributed  relational  database  system. 

In  our  approach,  each  database  system  is  presented  to  a 
user  with  a  global  relational  schema.  A  query  for  data 
access  or  update  is  specified  in  terms  of  a  relational 
calculus-like  qualification  over  relations  with  a  target 
list.  Codd's  data  sublanguage  ALPHA  (OSL  ALPHA)  [CODD  72]  is 
one  of  the  calculus-based  data  sublanguages.  It  consists 
simply  of  the  relational  calculus  in  a  syntatic  form  which 
more  closely  resembles  that  of  a  programming  language.  In 
practice,  the  syntax  would  have  to  be  compatible  with  that 
of  the  host  language,  whatever  that  was.  For  our  purpose, 
we  shall  use  the  syntax  of  ALPHA  which  is  expressed  in  [CODD 
70]. 
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Given  a  relational  Alpha  query  presented  to  the  system, 
the  optimizer  in  the  query  processing  unit  of  the  system 
will  find  an  optimal  strategy  and  transform  the  query  into 
an  optimal  sequence  of  relational  algebra  operators.  In 
order  to  execute  the  operations  against  the  local  DBMS,  the 
query  translator  will  have  to  translate  a  relation  algebra 
operation  against  the  local  DBMS  into  a  program  of  data 
manipulation  language  (DML)  statements  of  the  target  system. 

Join,  Semijoin,  Union,  Intersection  and  Difference  of 
two  relations  require  two  operands  having  the  same  set  of 
attributes.  They  can  be  executed  by  using  project  and  select 
operations  to  retrieve  data  and  put  it  into  relational  form 
and  then  perform  the  desired  binary  operation.  The  details 
of  query  translation  will  be  discussed  in  the  next  chapter. 

5.5  Conclusions 

We  have  described  the  architecture  of  a  heterogeneous 
database  communication  system  in  this  chapter.  The  database 
communication  system  is  an  approach  to  integrating 
heterogeneous  database  management  systems.  We  believe  that 
the  integration  of  many  independent,  distributed  information 
resources,  should  be  helpful  in  information  retrieval  and 
decision  making  in  solving  real  world  problems.  There  are 
several  problems  which  need  further  study  in  order  to  make 
the  system  successful:  query  optimization,  distributed 
concurrency  control,  translation  rules,  and  security 
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control,  etc. 

In  the  environment  of  business  operations,  time  is  a 
very  important  factor.  The  user  should  be  able  to  easily 
form  a  query  and  have  the  system  retrieve  the  data  and 
recompose  it  for  quick  presentation  to  the  user.  Query 
optimization  is  the  most  important  component  and  is  the 
subject  of  this  thesis.  Concurrency  control  problems  have 
been  widely  studied,  mostly  for  centrally  controlled 
systems.  We  need  to  study  and  develop  algorithms  which  are 
suitable  for  a  distributed  environment.  For  the  translation 
between  global  schema  and  local  schema,  and  between  a  query 
in  a  global  language  and  a  query  in  a  local  language,  we 
need  to  study  the  rules  for  translation  for  different  data 
models  and  manipulation  languages.  This  is  the  subject  of 
the  next  chapter.  Security  control  is  one  of  the  most 
important  problems  in  DBMS  systems.  Because  the  data  are 
often  integrated  together,  the  security  control  of 
classified  information  is  essential.  The  mechanism  for 
checking  access  rights  and  encryption  of  the  information 
flowing  throughout  the  network  deserves  further  study.  This 
solution  of  a  database  communication  system  makes  the 
distributed  system  transparent  to  users  from  an  operational 
point  of  view.  It  is  hoped  that  such  a  database 
communication  system  will  increase  the  efficient  usage  and 
management  of  information  and  data  of  heterogeneous  database 
management  systems. 


In  the  next  chapter,  ve  will  study  query  processing  in 
heterogeneous  database  management  systems.  Schema 
translation  and  query  translation  will  also  be  described. 
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Chapter  6 

Schema  and  Query  Translation 


6.1  Introduction 

Based  on  the  architecture  of  the  database  communication 
system  we  proposed  in  the  last  chapter,  we  adopt  here  the 
relational  model  as  the  global  conceptual  model.  It  is 
necessary  to  provide  a  relational  schema  for  each  database 
management  system  (DBMS).  For  those  databases  in  which  the 
underlying  data  models  are  not  relational  models,  schema 
translators  will  be  necessary  in  order  to  provide  the 
relational  view  to  the  user.  For  each  specific  data  model, 
a  specific  schema  translator  is  needed  for  a  specific  data 
model  to  translate  the  underlying  schema  to  a  relational 
logical  schema.  If  the  underlying  data  is  a  relational 
model,  in  some  cases  we  still  need  to  do  some  adjustment  and 
provide  a  uniform  relational  schema.  For  example,  in  one 
database,  the  students'  information  relation  will  have 
attributes  (course,  faculty,  studentID,  grade),  and  in  the 
other  database,  there  will  be  a  relation  with  attributes 
(course,  professor,  studentID,  grade).  The  attribute 
"faculty"  in  one  relation  in  fact  has  the  same  domain  as  the 
attribute  "professor"  in  the  other  relation.  We  need  to  put 
this  condition  as  an  integrity  constraint  in  the  integrity 
schema  component  and  provide  a  new  uniform  relation  schema 
for  this  system.  The  other  problem  we  also  need  to  consider 
is  the  data  incompatability .  In  the  first  relation,  the 
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grade  could  be  given  by  a  number  scale  between  0  and  100  , 
and  the  second  relation  could  have  the  grade  in  the  literal 
scale  ”A”  to  "F."  The  data  incompatability  must  be  taken 
care  of  by  the  integrity  schema.  Schema  translators  are  one 
of  the  topics  that  will  be  discussed  in  this  chapter. 

Data  manipulation  facilities  will  also  be  discussed  in 
this  chapter.  In  our  heterogeneous  database  communication 
system,  we  use  ALPHA,  a  non-procedural  relational  calculus 
like  language,  to  express  a  query.  Given  a  query,  it  will  be 
analyzed  by  the  query  optimizer  and  be  transformed  to  a 
sequence  of  relational  algebra  operators  against  those  local 
relational  schemata.  Because  the  underlying  data  models  are 
not  necessary  relational  models,  it  is  also  necessary  to 
provide  translation  rules  to  translate  the  relational 
operations  into  corresponding  data  manipulation  language 
statements  of  the  underlying  data  model.  In  this  chapter,  we 
address  the  problem  of  designing  a  schema  translator  and  a 
query  translator  of  a  specific  data  model. 

Relational,  network,  and  hierarchical  models  are  the 
three  major  data  models  that  have  been  used  in  database 
systems.  The  relational  model  has  been  adopted  here  as  the 
global  conceptual  model.  We  need  to  consider  schema 
translators  from  relational  model  to  relational  model,  from 
relational  model  to  network  model,  and  from  relational  model 
to  hierarchical  model.  We  also  need  to  consider  query 
translators  from  relational  algebra  operators  to 
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corresponding  data  manipulation  languages  of  the  three  data 
models.  As  we  can  .see,  the  schema  translator  and  the  query 
translator  for  the  case  in  which  the  underlying  data  model 
is  relational  are  rather  easy  to  develop.  We  leave  this 
case  here  without  any  futher  detailed  specification. 
Also, since  the  present  state  of  the  art  is  such  that  the 
existing  commercial  systems  are  almost  exclusively  based  on 
one  of  the  other  two  models,  we  therefore  will  restrict  our 
effort  here  to  the  design  of  translators  for  systems  based 
on  these  two  models. 

Hierarchical  systems  provide  for  the  same  form  of 
association  between  two  records  as  does  the  network  system. 
A  hierarchy  is  simply  a  network  that  is  a  forest  (a 
collection  of  trees)  in  which  all  links  point  in  the 
direction  from  child  to  parent.  If  we  have  an  hierarchical 
design  we  can  thus  clearly  regard  it  as  a  particular  network 
specification  and  will  have  no  difficulty  implementing  it 
using  a  network-based  software  product.  It  is  not  difficult 
to  see  that  similar  remarks  can  be  made  about  the  schema 
translation  and  query  translation.  Because  of  this  reason, 
we  concentrate  our  effort  in  this  chapter  on  schema 
translation  from  a  network  model  to  a  relational  model  and 
query  tranlation  from  relational  algebra  operators  to 
network  data  manipulation  languages. 

We  focus  on  those  changes  that  must  be  made  because  of 
the  difference  in  the  level  of  procedurality  of  the 


relational  algebra  operators  and  the  data  manipulation 
language  of  target  data  models.  We  do  not  consider  the 
problem  of  modifying  the  program  when  the  schema  is  altered 
due  to  database  redesign  and  evalution.  We  also  do  not 
attempt  to  address  the  issues  involved  in  the  implementation 
of  these  translation  rules,  such  as  the  syntax  of  the 
language  used  in  a  specific  system. 

This  chapter  is  organized  as  follows.  In  the  next 
section,  schema  translation  rules  are  formulated  for 
translating  C00AS7L/DBTG  schema  into  relational  schema. 
Based  on  this  translation  rule,  we  derive  the  query 
translation  rules  for  mapping  relational  algebra  operations 
into  C0DAS7L  data  manipulation  language  statements  in 
section  6.3. 

6.2  Schema  Translation 

A  great  deal  of  attention  has  been  focused  on  the 
network  approach  since  the  publication  in  April  1971  of  the 
C0DAS7L  DBTG  final  report  [CODA  71].  The  initial  DBTG 
specifications  have  undergone  subsequent  development  and 
refinement  as  reported  in  [CODA  73]  t  [CODA  78]  by  CODASYL 
groups.  A  number  of  commercially  available  systems  have  used 
one  or  more  versions  of  the  specifications  as  the 
implementation  base.  While  those  commercial  implementations 
may  show  slight  differences,  their  underlying  concepts  are 
based  on  the  same  CODASYL/DBTG  data  model. 
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The  OBTG  specifications  propose  three  levels  of  data 
organization,  associated  data  definition  language,  and  the 
language  for  processing  this  data: 

1.  The  schema  data  definition  language,  schema  DDL. 

2.  The  subschema  data  definition  language,  subschema  DDl. 

3.  The  data  manipulation  language,  DML. 

4.  The  device/media  control  language,  DMCD. 

The  schema  is  the  logical  description  of  the  data  base. 
A  schema  description  in  the  DBTG  DDL  includes  four  types  of 
declarations. 

1.  The  schema  name  description--  it  is  unique  for  each 
schema  handled  by  the  DBMS. 

2.  Record  type  declarations—  they  define  the  data  items 
for  each  record. 

3.  Set  declarations —  they  define  the  relationships 
between  defined  record  types. 

4.  Area  declarations —  they  define  the  physical  areas  in 
which  records  will  be  stored. 

There  are  two  kinds  of  record  types  in  the  CODASYL/DBTG 
model:  a  description  record  type  and  a  connection  record 
type. 


168 


A  description  record  type  in  the  DBTG  data  model  has  a 
record  ID,  and  one  or  several  attributes  describing 
properties  of  the  record.  It  is  very  similar  to  a  relation 
in  the  relation  data  model  with  the  record  ZD  as  the  key 
attribute.  Therefore,  a  description  record  type  can  be 
translated  to  a  relational  schema  directly. 

A  connection  record  type  is  introduced  when  n  types  of 
entity  (represented  by  n  description  record  types  )  are  to 
be  connected.  N  set  types  are  also  introduced.  Each  of  the 
n  "entity"  record  types  is  made  the  owner  of  one  of  the  set 
types,  and  the  connection  record  type  is  made  the  member  of 
the  set  types;  and  each  connection  record  occurrence  is  made 
a  member  of  exactly  one  occurrence  of  each  of  the  n  types  of 
set  and  thus  represents  the  connection  between  the 
corresponding  n  entities.  For  this  record  type  we  define  a 
relation  schema  R  with  attributes  consisting  of  the  keys  of 
the  owners  of  the  n  sets  in  which  this  record  is  a  member 
and  the  data  items  of  this  record.  The  key  of  this  relation 
is  the  set  of  keys  of  the  owners  of  the  n  sets. 

A  set  type  is  defined  in  the  schema  as  having  a  certain 
type  of  record  as  its  owner  and  some  other  type  of  record  as 
its  member.  Bach  occurrence  of  a  set  type  consists  of 
precisely  one  occurrence  of  its  owner  together  with  zero  or 
more  occurrences  of  its  member.  For  each  set  type,  we  do 
not  correspondingly  define  a  relational  schema.  Instead,  we 


create  a  table  in  each  node  to  record  all  the  sets  which  can 
be  thought  of  as  defining  an  access  path  relation.  This 
table  contains  three  attributes  {  set,  owner,  member  It 
is  used  in  the  query  translation  process  from  relational 
operator  to  CODAS YL  DML  statements  to  identify  the  access 
path.  The  table  can  be  thought  of  as  a  new  relation  which 
is  stored  in  the  local  database  and  only  used  for  query 
translation.  This  table  provides  information  of  record 
access  paths.  It  isn't  joined  to  the  global  schema  to  be 
presented  to  the  user.  A  singular  set  is  a  set  with  a 
system  as  an  owner  and  a  description  record  as  a  member.  It 
can  be  thought  of  as  a  set  having  exactly  one  occurrence  and 
having  no  owner  record.  By  using  the  singular  set  construct, 
the  set  of  description  record  occurrences  is  exactly  like  a 
sequential  file.  It  provides  the  user  an  access  path  to 
access  the  description  record.  For  the  singular  set,  we  also 
keep  the  set  information  on  the  table. 

An  area  is  a  storage  space  of  a  DBTG  database.  For  each 
type  of  record  the  schema  specifies  the  area  into  which 
occurrences  of  the  record  are  to  be  placed  when  they  are 
entered  into  the  database.  This  concept  can  be  thought  of 
as  the  vertical  and  horizontal  partition  of  the  database. 
For  instance,  consider  a  university  application  that  creates 
student  records.  The  database  administrater  may  decide  for  a 
varity  of  reasons  that  instead  of  representing  status  as  a 
data-item  in  the  student  record  type,  the  classification  is 


to  be  made  by  storing  student  records  in  two  distinct  a.eas, 
graduate  and  undergraduate.  This  area  type  is 
correspondently  mapped  to  the  horizontal  or  vertical 
partition  of  relational  database  which  may  be  necessary  to 
create  a  new  relational  schema. 

In  this  thesis,  we  shall  assume  that  all  occurrences  of  a 
given  type  of  record  are  to  go  into  a  single  area. 

In  conclusion,  we  summarize  the  translation  rules  in  Figure 
6.1,  and  give  an  example  of  the  translation  rules. 


1.  For  a  description  record  type  R,  we  define  a 
relation  schema  R' ,  with  each  data  item  to 
be  a  attribute  of  R.  The  identifier  of  R  is 
the  key  of  R' . 

2.  For  a  connection  record  type,  we  define  a 
relation  schema  S'  with  attributes 
consisting  of  the  identifiers  of  the  owners 
of  the  sets  in  which  S  is  a  member  and  the 
data  items  of  S.  The  set  of  identifiers  of 
the  owners  of  the  set  is  the  key  of  S'. 

3.  For  each  set-type,  we  maintain  a  table  which 
contain  three  attributes:  set,  owner  and 
member . 


Figure  6.1 


EXAMPLE  (DATE  77] 

Assume  the  supplier-part-project  database  S  is  stored 
in  a  C0DAS7L  version  DBMS.  The  schema  of  this  database 
is  defined  as  follows. 


172 

SCHEMA  NAME  IS  S-P-J-SCHEMA 
AREA  NAME  IS  S-AREA 
AREA  NAME  IS  P-AREA 
AREA  NAME  IS  J-AREA 
AREA  NAME  IS  SPJ-ARBA 

RECORD  NAME  IS  S; 

LOCATION  MODE  IS  CALC  HASH- SNO  USING  SNO  IN  S? 
WITHIN  S-AREA; 

IDENTIFIER  IS  SNO  IN  S. 

02  SNO  ;  TYPE  IS  CHARACTER  4. 

02  SNAME  ;  TYPE  IS  CHARACTER  20. 

02  STATUS  ;  TYPE  IS  CHARACTER  3. 

02  CITY  ;  TYPE  IS  CHARACTER  15. 

RECORD  NAME  IS  P; 

LOCATION  MODE  IS  CALC  HASH-PNO  USING  PNO  IN  P; 
WITHIN  P-AREA; 

IDENTIFIER  IS  PNO  IN  P. 

02  PNO  ;  TYPE  IS  CHARACTER  4. 

02  PNAME  ;  TYPE  IS  CHARACTER  20. 

02  COLOR  ;  TYPE  IS  CHARACTER  6. 

02  WEIGHT  ;  TYPE  IS  CHARACTER  4. 

RECORD  NAME  IS  J; 

LOCATION  MODE  IS  CALC  HASH-JNO  USING  JNO  IN  J; 
WITHIN  J-ARSA; 

IDENTIFIER  IS  JNO  IN  J. 


02  JNO  ;  TYPE  IS  CHARACTER  4. 

02  JNAME  ;  TYPE  IS  CHARACTER  20. 

02  CITY  ;  TYPE  IS  CHARACTER  15. 

RECORD  NAME  IS  SPJ; 

LOCATION  MODE  IS  SYSTEM- DEFAULT; 

WITHIN  SPJ- AREA; 

IDENTIFIER  IS  SNO  IS  SPJ, 

PNO  IN  SPJ, 

JNO  IN  SPJ. 

02  SNO  ;  TYPE  IS  CHARACTER  5. 

02  PNO  ;  TYPE  IS  CHARACTER  6. 

02  JNO  ;  TYPE  IS  CHARACTER  4. 

02  QTY  ;  TYPE  IS  FIXED  DECIMAL  5. 

SET  NAME  IS  S-SPJ; 

OWNER  IS  S; 

ORDER  IS  PERMANENT  SORTED  BY  DEFINED  KEYS. 
MEMBER  IS  SPJ 

INSERTION  IS  AUTOMATIC 
RETENTION  IS  MANDATORY; 

KEY  IS  ASCENDING  PNO  IN  SPJ,  JNO  IN  SPJ 
DUPLICATES  ARE  NOT  ALLOWED 
NULL  IS  NOT  ALLOWED; 

SET  SELECTION  IS  THRU  S-SPJ  OWNER 

IDENTIFIED  BY  IDENTIFER  SNO  IS  S 

EQUAL  TO  SNO  IN  SPJ. 


SET  NAME  IS  S-SBT; 

OWNER  IS  SYSTEM; 

ORDER  IS  PERMANENT  SORTED  BY  DEFINED  KEYS. 
MEMBER  IS  S 

INSERTION  IS  AUTOMATIC 
RETENTION  IS  FIXED; 

KEY  IS  ASCENDING  SNO  IN  S 

DUPLICATES  ARE  NOT  ALLOWED 
NULL  IS  NOT  ALLOWED; 

SET  SELECTION  IS  THRU  S-SET  SYSTEM. 

SET  NAME  IS  P-SPJ; 

OWNER  IS  P; 

ORDER  IS  PERMANENT  SORTED  BY  DEFINED  KEYS. 
MEMBER  IS  SPJ 

INSERTION  IS  AUTOMATIC 
RETENTION  IS  MANDATORY; 

KEY  IS  ASCENDING  JNO  IN  SPJ,  SNO  IN  SPJ 
DUPLICATES  ARE  NOT  ALLOWED 
NULL  IS  NOT  ALLOWED; 

SET  SELECTION  IS  THRU  P-SPJ  OWNER 

IDENTIFIED  BY  IDENTIFER  PNO  IN  P 

EQUAL  TO  PNO  IN  SPJ. 


SET  NAME  IS  P-SET; 
OWNER  IS  SYSTEM; 


ORDER  IS  PERMANENT  SORTED  BY  DEFINED  KEYS 
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MEMBER  IS  P 

INSERTION  IS  AUTOMATIC 
RETENTION  IS  FIXED; 

KEY  IS  ASCENDING  PNO  IN  P 

DUPLICATES  ARE  NOT  ALLOWED 
NULL  IS  NOT  ALLOWED; 

SET  SELECTION  IS  THRU  P-SET  SYSTEM. 

SET  NAME  IS  J-SPJ; 

OWNER  IS  J; 

ORDER  IS  PERMANENT  SORTED  BY  DEFINED  KEYS. 
MEMBER  IS  SPJ 

INSERTION  IS  AUTOMATIC 
RETENTION  IS  MANDATORY; 

KEY  IS  ASCENDING  SNO  IN  SPJ,  PNO  IN  SPJ 
DUPLICATES  ARE  NOT  ALLOWED 
NULL  IS  NOT  ALLOWED; 

SET  SELECTION  IS  THRU  J-SPJ  OWNER 

IDENTIFIED  BY  IDENTIFIER  JNO  IN  J 
EQUAL  TO  JNO  IN  SPJ. 


SET  NAME  IS  J-SET; 

OWNER  IS  SYSTEM; 

ORDER  IS  PERMANENT  SORTED  BY  DEFINED  KEYS. 
MEMBER  IS  J 

INSERTION  IS  AUTOMATIC 
RETENTION  IS  FIXED; 
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key  is  ascending  jno  in  j 

DUPLICATES  ARB  NOT  ALLOWED 
NULL  IS  NOT  ALLOWED; 

SET  SELECTION  IS  THRU  J-SET  SYSTEM. 


After  applying  the  rules  of  schema  translation,  we  define 
the  following  relational  schema 

S«(SNO,  SNAME,  STATUS,  CITY) 

P« ( PNO .  PHAME,  COLOR,  WEIGHT) 

J-(JNO,  JNAME,  CITY) 

SPJ-(SNO,  PNO.  JNO,  QTT). 

We  create  an  access  path  relation  as  follows: 


set 

owner 

member 

S-SPJ 

S 

SPJ 

J-SPJ 

J 

SPJ 

P-SPJ 

P 

SPJ 

S 

SYSTEM 

S 

p 

SYSTEM 

P 

J 

SYSTEM 

J 

6.3  Query  Translation 

In  our  database  communication  system,  each  database 
system  is  presented  to  a  user  with  a  global  relational 
schema.  A  query  for  data  access  or  update  is  specified  in 
terms  of  a  relational  calculus-like  qualification  over 
relations  with  a  target  list.  Codd's  data  sublanguage  ALPHA 
(DSL  ALPHA)  [CODD  72]  is  one  of  the  calculus-based  data 
sublanguages.  It  consists  simply  of  the  relational  calculus 
in  a  syntactic  form  which  more  closely  resembles  that  of  a 
programming  language.  In  practice,  the  syntax  would  have  to 
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be  compatible  with  that  of  the  host  language,  whatever  that 
was. 


Por  our  purpose,  we  shall  use  the  syntax  of  ALPHA  which 
is  expressed  in  [CODD  70].  We  restrict  ourselves  to 
consideration  of  its  major  features  only.  Assume  a  query  Q 
express  in  ALPHA  issued  at  site  S.  An  example  of  a  query 
expressed  in  ALPHA  is: 

EXAMPLE: 

Let  the  query  be  "Get  SNO  values  for  suppliers  who 
supply  a  LONDON  or  PARIS  project  with  a  red  part.” 

The  corresponding  ALPHA  statement  of  this  query  will  be 

RANGE  P  PX 
RANGE  J  JX 

GET  W  (SPJ.SNO)  :  3  PX  3  JX  ( PX . COLOR- ' RED ' 

A ( JX . Cl TT- ' LONDON 'VJX.CITY- ' PARI S ’ ) 

A  SPJ . PNO-PX . PNO  A  SPJ.JNO-JX.JNO) 

The  list  of  attributes  within  parenthesis  of  w  is  the 
target  list  which  specifies  the  attributes  to  retrieve. 
The  predicate  calculus  following  is  the 

qualification 

The  query  optimizer  in  the  query  processing  unit  of  the 
system  will  transform  the  query  into  a  sequence  of 
relational  algebra  operations.  This  sequence  of  relational 
algebra  operations  is  the  optimal  query  processing  strategy. 
Some  of  the  DBMSs  may  be  implemented  on  the  C0DAS7L  model. 
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In  order  to  execute  the  operations  against  the  local  DBMS, 
we  have  to  translate  a  relation  .  algebra  operation  against 
this  DBMS  to  a  program  of  CODASYL  DML  statements.  The  query 
translator  of  this  section  plays  the  role  of  this  process. 

CODASYL  Data  Manipulation  Language 

We  briefly  introduce  the  way  queries  are  specified  in 
CODASYL  DML.  In  the  CODASYL  model,  the  major  DML  statements 
are  the  following: 

FIND - locates  an  existing  record  occurrence  and 

establishes  it  as  the  current  of  run-unit. 

GET - retrieves  the  current  of  run-unit. 

MODIFY - updates  the  current  of  run-unit. 

CONNECT - inserts  the  current  of  run-unit  into  one  or  more 

set  occurrences. 

DISCONNECT — removes  the  current  of  run-unit  from  one  or  more 
set  ocurrences. 

ERASE - deletes  the  current  of  run-unit. 

STORE - creates  a  new  record  occurence  and  establishes 

it  as  the  current  of  run-unit. 

A  query  in  the  CODASYL  model  is  a  sequence  of  DML 
statements  which  are  embedded  within  a  program  as  a 
syntactic  extension  of  the  host  language.  The  function  of 
the  FIND  statement  is  to  locate  a  record  occurrence  in  the 
database  and  to  make  it  the  current  of  the  run-unit  (the 
most  recently  accessed  record  occurrence),  also  the  current 
of  the  appropriate  area  (logical  partitioning  of  the 


180 


database),  the  current  of  the  appropriate  record  type,  and 
the  current  of  all  sets  in  which  it  participates.  It  is 
logically  required  before  each  of  the  other  statements, 
except  STORE.  Note  that  the  FIND  statement  itself  does  not 
retrieve  any  data.  For  each  program  operating  under  its 
control,  the  DBMS  maintains  a  table  of  "currency  status 
indicators”  in  the  user  working  area  (UWA) .  These  indicators 
are  actually  database-key  values. 

The  current  of  the  run-unit,  whatever  its  type,  can  be 
brought  into  the  UWA  only  by  executing  GET.  The  modify 
statement  replaces  (portions  of  )the  current  of  the  run-unit 
with  values  taken  from  the  UWA.  The  STORE  statement  will 
store  newly  constructed  data  items  in  the  UWA  into  the 
database. 

In  the  distributed  DBMS  environment,  we  assume  each 
system  has  a  SEND  command  which  can  send  a  file  or  part  of 
the  UWA  from  one  system  to  another  system. 

To  update  a  database,  or  to  create  a  new  record,  it  is 
necessary  to  retrieve  the  data  occurrence  to  be  updated  or 
to  create  a  new  data  occurrence  in  the  UWA.  After  updating 
the  data  occurrence  in  the  UWA,  a  MODIFY  or  STORE  statement 
is  then  applied.  Therefore,  we  need  to  consider  only 
retrieve  operations. 

We  first  study  the  translation  procedures  for  two  unary 
algebra  operators:  project  and  select. 
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He  assume  R»{Aj  ,KX, . . .  ,A^  j  and  I»{X,  ,  ...,AK3.  PROJECT, 
denoted  by  TTy(R)#  is  to  retrieve  the  attributes  in  X  of 
each  record  in  R.  The  algorithm  PROJECT^  (R,X) ,  as  shown 
below,  is  to  translate  TTX (R)  into  a  DML  program. 

PROJECTOR,  X): 

IF  (  (t)-’R-set'  and  system' ) 

/*  description  record  */ 

THEN  NXT:  FIND  next  R  within  R-set; 

IF  end  of  set  GOTO  quit; 

GET  AjIN  R,  . . . . ,  Ak  IN  R? 

GOTO  NXT. 

ELSE  IF  (  (t )*' S-R'  and 

/*  connection  record  */ 

THEN  NXT:  FIND  next  R  within  S-R; 

IF  end  of  set  GOTO  quit; 

GET  A  |  in  R,...,  A in  R. 

GOTO  NXT. 

Let  Y"{A |  ,A^  ,...,  A^  }  SR.  SELECT,  denoted  by 
6^^<a,^R),  is  to  select  tuples  in  R  such  that  R.Aj-'c,  ',.... ,  R.A^ 
»’cK'.  The  algorithm  below  is  the  translation  algorithm. 

SELECT  ( R ,  Y ,  C|  ,  c^  , . . . ,  c^)  S 

IF  (R  is  a  connection  record) 

THEN  if  (some  A  £  is  and  attribute  of  S)  6  (S-R  is  a  set) 


THEN  BEGIN 
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MOVE  'c,'  to  A,  in  S; 
FIND  any  S; 

IF  S-R  empty  GOTO  quit; 
MOVE  ’ca*  to  Aa  in  R; 


MOVE  '  c^’  to  A^  in  R; 

FIND  any  R  within  S-R; 

IF  end  of  set  GOTO  quit; 

GET  R; 

END; 

ELSE  BEGIN 

/*  there  exist  some  S  s.t.  S-R  is  a  set  */ 
NXTs  FIND  next  S  within  S-set; 

IF  end  of  set  GOTO  quit; 

GET  A  in  S; 

MOVE  'c,'  to  A,  in  R; 

e 

e 

• 

MOVE  'cK'  to  Ak  in  R; 

FIND  any  R  within  S-R; 

GET  A |  m  R,  •••  f  A^  in  R ; 

IF  end  of  set  GOTO  NET; 

END; 


Next,  we  look  at  binary  relation  algebra  operations. 
Several  binary  relation  algebra  operations  (e.g.,  division, 
join,  semi join,  etc.)  which  reference  two  relations  may  be 
stored  at  the  same  system,  in  this  case,  data  must  be  sent 
from  one  system  to  another.  For  a  CODAS YL  DBTG  model  DBMS, 
there  are  three  strategies  for  sending  data  through  the 
network:  record-at-a-time  transmission,  set-at-a-time 
transmission  and  all-records-at-a-time  transmission.  In  a 
C0DAS7L  DBTG  model  DBMS,  record-at-a-time  and  set-at-a-time 
access  operators  are  supported  which  enable  one  node  to  send 
data  to  the  other  node  after  performing  such  an  operation. 
The  data  transmission  of  these  two  types  of  operations 
requires  considerable  communication  overhead.  In  a 
relational  DBMS,  all  records  required  by  another  system  are 
usually  transmitted  at  a  time. 

We  assume  in  this  thesis  that  the  transmission  mode  of 
any  system  will  be  all-records-at-a-time.  We  also  assume 
that  the  data  are  in  relational  table-like  form  after  being 
retrieved  from  local  DBMSs  and  temporarily  stored  in  the  uwa 
of  each  local  DBMS.  We  assume  each  DCS  has  the  ability  to 
execute  relational  algebra  operations.  This  assumption  will 
enable  easier  data  transmission  when  it  is  required.  It  also 
makes  the  translation  procedures  of  binary  relational 
algebra  operators  distinct  between  the  two  cases  in  which 
the  operands  of  the  operator  are  at  the  same  system  and 
where  they  are  not. 


184 


When  the  two  operands  of  the  operator  are  in  different 
systems,  we  always  retrieve  the  first  operands  and  store 
them  in  the  user  working  area  as  a  relation  and  then  send  it 
to  the  other  system  to  produce  the  final  result.  We  will 
first  consider  the  case  when  two  operands  are  in  different 
sites. 


Two  of  the  most  useful  distributed  binary  relational 
algebra  operators  for  distributed  query  processing  and  the 
execution  sequence  are  join  and  semijoin,  defined  below: 

Let  R,  and  R*  be  two  relation  schemas  at  different 
systems  S,  fc  S*  respectively  and  z«R(nR2  be  the  set  of 
attributes  in  R(  and  R2  •  Without  lost  of  generality,  we 
assume  X*{A(  , , ...,A|e'].  Let  T(  and  be  the  temporary 
relations. 

1.  JOIN 
DEFINITION: 

The  distributed  join  of  R,  and  Ra  over  X  is  a 

distributed  query  operator  which  executes  the  following 

sequence  of  operations: 

1.  retrieve  R^  from  as  T^  ; 

2.  send  T^  from  S2  to  S, ; 

3.  R  |X|  T,  at  site  S, . 

X 

The  query  translation  of  this  operator  is  as  follow: 
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IF  R|  and  Rz  are  both  relational  DBMSs 
THEN  execute  it  as  relational  operator; 

ELSE  IP  R(  is  relational  and  R^  is  C0DAS7L 
THEN 

Pjj(Ra,Ra)  at  Sz  as  T  ; 
send  T  from  Sx  to  S,  ; 

execute  R, |Z(  R,  as  relational  operator; 

X 

ELSE  IF  R  is  C0DAS7L 
1 

THEN 

retrieve  R^  from  S^as  a  elation  T; 
send  T  from  S^  to  S(; 

For  all  t€T 

SELECT^ ( R(  ,Rj-X, tf  ftA, . . . f  tK) ; 

This  operation  will  create  a  new  relation  T'  with  relation 
schema  R^  U  R^at  the  working  area  of  system  S,. 

2.  SEMI JOIN 
DEFINITION: 

The  distributed  semi  join  of  R(  and  Ra  over  X,  R(<|Z|  Ra 
,  execute  the  following  sequence  of  operations: 

1.  project  R | over  I  at  S(  and  form  a  result  T, ; 

2.  send  T(  to  Sa; 

3.  join  T j  with  R^  at  Sa  and  form  a  result  T^  ,  i.e. 
select  R 2,  based  on  Tj  at  ; 

4.  send  T^  to  S,; 

5.  join  R2  to  R  |  at  S  |  to  form  a  result  T^  at  S,  ,  i.e. 
select  R|  based  on  X  columns  of  R^. 


This  operation  can  be  executed  as  follows: 

IF  R |  and  R^  are  both  relational  systems 
THEN  execute  the  operator  as  relational  operator 
ELSE  IF  R,  is  relational  fc  R^  is  C0DAS7L 
THEN 

project  R(  on  X  at  S(  as  T,  ; 

send  T(  to  S^; 

for  each  (t(  ,t^, . . .  ,tK)  6T, 

SELECTji  ( R^  f  R^  ,  1 1 ,  •  •  •  *  t p ) 

and  form  result  as 
send  T^  to  S, ; 

join  R^to  R,  at  S(  as  relational 
operator; 

ELSE  IF  R(is  C0DAS7L  &  R^  is  relational 
THEN 

PROJECTOR,  ,X)  as  T, ; 
send  T ,  to  Sz ; 

join  T (  with  Ra  as  relational 
operator  to  get  T*; 
send  T^  to  S, ; 
for  t*(t,  , .  *  *  ft*, . . . , t^  )6T 
SELECTOR,  ,R,  , 
and  form  result  as  Tx; 

ELSE  IF  both  R  |  6  R^are  C0DAS7L 
THEN 


PROJECTOR,  ,X)  as  T.; 
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send  T, to  S2 ; 
for  each  t6  T, 

SELECT  (Ra,Ra,t) 
and  form  result  as 
send  Tx  to  S  t ; 

for  all  t»  (t,  , . . . , t^,  t,^  , . .  ,t^) 
Ti#  SELECT  (R, ,R, ,(t tK) ) 
form  a  result  Tj ; 


The  Union,  Intersection  and  Difference  of  two  relations 
require  two  operands  having  the  same  set  of  attributes.  If 
both  two  operands  are  stored  in  relational  systems,  then  we 
just  perform  the  relational  operation.  If  one  of  the  two 
operands  R  are  stored  in  CODASYL  systems,  we  first  use 
PROJECTOR, R)  to  retrieve  R  to  form  a  relation  and  then 
perform  the  relational  operation  afterward.  If  both 
operands  are  stored  in  CODASYL  systems,  then  we  must 
consider  two  cases.  If  the  results  of  the  operation  are 
temporarily  stored  in  the  UWA  for  use  by  later  operations, 
then  we  use  both  PROJECT^  (R,  ,R( )  and  PROJECT^fR^R^)  to 
retrieve  both  two  operands  as  two  relations  and  send  one 
relation  from  one  site  to  the  other  to  perform  the 
operation;  or  we  can  use  PROJECT  to  retrieve  one  operand  as 
a  relation  and  send  it  to  the  other  site  and  then  perform 
selections. 
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The  second  case  is  that  in  which  the  results  of  the 
operation  are  to  be  stored  in  one  site.  Zn  this  case,  we  use 
PROJECT  to  retrieve  one  operand  as  a  relation  and  send  it  to 
the  other  site  and  perforin  corresponding  CODASYL  DML 
statements. 

When  the  two  operands  of  a  binary  operation  are  at  the 
same  site  and  are  stored  in  a  CODASYL  system,  then  we  can 
perform  the  operation  in  the  same  sequence  we  discussed 
above.  However,  we  can  do  it  in  a  more  efficient  way  because 
it  is  not  necessary  to  fisrt  retrieve  one  operand  as  a 
relation  in  the  execution  of  this  operation.  We  can  use 
solely  CODASYL  DML  statements  to  perform  this  operation  and 
do  it  by  record-at-a-time  or  set-at-a-time.  In  some  cases, 
we  even  can  do  much  better  by  combining  a  sequence  of 
relational  opeations  which  reference  data  stored  at  the  same 
CODASYL  system  and  then  translating  them  to  a  sequence  of 
CODASYL  DML  statements. 

Let  us  look  at  two  examples:  We  assume  S,  P,  J,  SPJ  are 
as  in  example  1  and  they  are  stored  at  the  same  system. 

EXAMPLE  2  Perform  the  operation  J  |X|  SPJ  . 

JVO 

We  can  translate  this  operation  into  a  CODASYL  DML 

statements  as  follow: 

NET:  FIND  next  J  within  J-SBT; 

ZF  end  of  set  G 0*0  quit; 

get  SPJ  and  form  a  new  relation; 


GOTO  NXT 


EXAMPLE  3 

Let  the  subquery  expressed  in  ALPHA  referenced  to  this 
C0DAS7L  DBMS  be 

PROJECT  (JOIN  (SELECT  SPJ  WHERE  JNO-’Jl') 

AND 

(SELECT  P  WHERE  COLOR-'RED' ) 

OVER  PNO)  OVER  SNO  SNAME  STATUS 

The  interpretation  of  this  subquery  is  to  get  SNO, 
SNAME,  STATUS  values  for  suppliers  who  supply  project 

J1  with  a  red  part. 

• 

The  corresponding  C0DAS7L  DML  statement  for  this 
subquery  could  be 

MOVE  'Jl'  TO  JNO  IN  J; 

FIND  AN7  J? 

IF  J-SPJ  EMPT7  GOTO  QUIT? 

MOVE  BLANK  TO  TEMP- SNO; 

NXT.  FIND  NEXT  SPJ  WITHIN  J-SPJ; 

IF  end  of  set  GOTO  QUIT; 

GET  SPJ 

IF  SNO  IN  SPJ  ■  TEMP- SNO  GOTO  NXT; 

MOVE  SNO  IN  SPJ  TO  TEMP- SNO; 

FIND  OWNER  WITHIN  P-SPJ; 


GET  P; 
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IF  COLOR  IN  P  -'RED' 

THEN 

MOVE  TEMP-SNO  TO  SNO  IN  S; 

FIND  ANY  S; 

GET  SNAME  IN  S,  STATUS  IN  S ; 

ADD  TEMP-SNC,  SNAME f  STATUS 

VALUES  TO  RESULT  LIST? 

ELSE  MOVE  BLANK  TO  TEMP-SNO; 

GOTO  NET. 

From  these  two  examples,  we  learn  that  some 
optimization  can  be  done  on  the  translation  of  subquery 
referencing  to  a  CODASYL  DBMS  into  a  sequence  of  CODASYL  DML 
statements.  This  is  one  of  the  optimization  problems  of 
local  processing.  We  do  not  plan  to  discuss  it  further 
within  this  thesis. 

6.4  Conclusions 

In  this  chapter,  we  have  presented  detailed  schema 
translation  rules  for  translating  a  CODASYL  data  model  to  a 
relational  data  model,  and  algorithms  for  translating 
relational  ALPHA  query  into  CODASYL  DML  statements  whose 
associated  databases  schemata  have  themselves  been 
translated.  The  translation  algorithms  are  developed  for 
each  relational  atomic  operation.  The  translation  procedures 
are  based  on  the  relational  operations  sequence  provided  by 
the  query  optimizer  with  the  objective  of  minimizing 
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communication  complexity  and  are  done  one  relational  algebra 
operation  at  a  time.  The  translation  procedures  also  use  the 
information  of  the  access  path  relation. 

Some  optimization  for  the  local  query  processing  can  be 
done  by  translating  a  subquery  which  is  a  continuing 
subsequence  of  relational  operations  which  reference  data  at 
the  same  DBMS  into  a  sequence  of  CODASYL  DML  statements 
rather  than  translating  one  operation  at  a  time.  Because 
our  major  concern  is  the  communication  cost,  this  local 
processing  optimization  will  not  be  considered  in  this 
thesis. 
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Chapter  7 

Summary  and  Puture  Research 

This  thesis  addresses  the  problem  of  query  processing 
in  distributed  database  management  systems.  In  distributed 
homogenenous  relational  databases,  we  develop  a  mathematical 
model  and  algorithms  for  solving  a  query  by  a  sequence  of 
joins  and  semijoins  using  mixed  strategy.  We  extend  this 
model  to  distributed  heterogeneous  databases  and  only 
consider  problems  of  schema  translation  and  query 
translation  which  are  parts  of  the  query  processing  problem. 
Many  other  problems,  e.g.,  consistency,  redundancy, 
concurrent  control,  security,  etc.,  remain  to  be  done.  We 
will  here  briefly  summarize  our  research  effort  and  suggest 
the  potential  direction  of  future  research  for  each  chapter 
and  then  make  a  few  concluding  remarks. 

7.1  Summary  and  Future  Research 

In  chapter  1,  we  discussed  the  need  for  research  in 
query  processing  in  the  integration  of  distributed  database 
management  systems.  That  chapter  also  contains  a  goal  and  a 
road  map  for  the  entire  thesis. 

In  chapter  2,  we  first  reviewed  some  formal  definitions 
of  relational  terminologies  and  previous  studies  of  the 
query  processing  problem  in  distributed  relational 
databases.  Under  the  assumption  that  network  traffic 
constitutes  the  main  critical  factor,  we  have  developed  a 


mathematical  model  to  compute  the  minimum  communication  coat 
of  a  join-semi join  program  for  processing  a  given  equi-join 
query  .  Por  each  equi-join  query,  we  define  a  query 
processing  graph  for  a  given  query  from  which  the  set  of 
join-semi join  programs  that  solve  the  query  can  be 
characterized.  A  rule  for  estimating  the  size  of  the  derived 
relation  is  proposed.  The  parameters  for  estimating  the  size 
of  the  derived  relation  form  a  consistent  parameter  system, 
with  the  assumption  that  the  communication  cost  dominates 
and  that  the  cost  functions  are  linear  in  the  sizes  of  the 
data  to  be  transmitted,  the  distributed  query  processing 
problem  is  formulated  as  a  dynamic  network  problem.  A  slight 
change  of  the  model  made  by  associating  each  node  with  a 
local  processing  cost  will  extend  the  model  to  the  case 
where  local  computer  processing  time  and  transmission  delay 
through  the  network  are  comparable.  This  could  happen  for 
distributed  databases  in  a  local  area  network  environment, 
for  example. 

One  future  research  direction  is  to  extend  this  model 
to  cover  a  larger  class  of  queries, e. g. ,  the  class  of 
ineqality-join  queries,  queries  with  existential  quatifier, 
etc.,  by  providing  a  model  of  measuring  the  reducibilities 
and  estimating  the  size  of  the  derived  relations.  Another 
potential  research  direction  is  the  optimization  of  the 
local  processing  cost.  In  the  case  where  both  local 
processing  costs  and  communication  costs  are  important,  the 
optimization  of  data  retrieval  when  local  computer 
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processing  is  used  becomes  essential.  The  access  paths  to 
the  stored  data,  the  implementation  algorithms  of  relational 
algebra  operations,  e.g.,  join,  project,  selection,  etc., 
and  the  order  of  relational  algebra  operations  are  also 
critical  factors  in  query  optimization.  It  would  be  very 
nice  to  provide  a  model  to  consider  all  those  factors  as 
well  as. communication  costs. 

In  chapter  3,  we  studied  the  computational  complexity 
of  the  distributed  query  processng  problem.  For  a  simpler 
case  where  all  semijoin  reducibilities  are  zero  and  join 
reducibilities  do  not  affected  by  join  operation,  we  have 
shown  that  under  three  different  objective  functions  the 
problems  of  finding  a  routing  strategy  of  required  data  to 
the  site  where  a  query  is  initiated  are  NP-complete.  This 
gives  us  an  indication  that  the  distributed  query  processing 
problem  is  a  hard  problem.  In  fact,  we  can  see  from  the 
nature  of  the  dynamic  network  problem  how  to  identify  the 
reason  for  the  difficulty  of  this  problem.  Future  research 
direction  will  be  the  complexity  of  the  query  processing 
problem  in  the  model  of  chapter  2. 

One  of  the  future  research  directions  is  to  identify  a 
set  of  conditions  so  that  under  those  assumptions,  the 
problem  will  be  solvable  using  a  polynomial  time  algorithm. 
Another  future  research  direction  is  to  find  the 
computational  complexity  of  an  approximation  problem  for  the 
distributed  query  processing  problem.  We  conjecture  that  the 
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'approximation  problems  are  also  very  hard  problems. 

In  chapter  4,  we  analyzed  the  difficult  nature  of  the 
query  processing  problem  and  provided  an  analytical  basis 
for  heuristic  algorithms.  We  first  considered  the  simpler 
case  of  the  problem  where  all  possible  semi  joins  are 
performed  first,  i.e.  all  semi join  reducibilities  become 
zero.  We  provided  several  heuristic  algorithms  for  this 
problem.  Each  of  the  algorithms  has  two  stages.  The  first 
stage  is  to  find  a  feasible  processing  strategy.  The  second 
stage  is  the  improvement  stage  where  interchange  procedures 
are  used.  We  then  extend  those  heuristic  algorithms  to  the 
general  query  processing  problem  by  including  semijoin 
operations  into  the  sequence  of  join  operations. 

A  future  research  direction  is  the  study  of  the 
analytic  behavior  of  those  heuristic  algorithms.  Although 
some  analytic  results  of  worst  case  and  average  case 
analysis  are  difficult  to  obtain,  some  computational 
experiments  may  be  conducted  to  get  a  feeling  for  the 
average  performance  of  the  algorithms.  Developing  more 
efficient  and  general  solution  procedures  for  general  query 
processing  problems  is  also  a  future  research  area. 

In  chapter  5,  we  developed  a  method  for  query 
processing  in  a  distributed  heterogeneous  database 
management  systems  environment.  A  heterogeneous  database 
communication  system  is  proposed  to  integrate  heterogeneous 
database  management  systems  to  combine  and  share 
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information.  The  architecture  of  a  heterogeneous  database 
communication  system  is  described  and  several  components  are 
identified.  The  use  of  a  database  communication  system  for 
heterogeneous  DBMSs  makes  the  overall  system  transparent  to 
users  from  an  operational  point  of  view.  In  this  chapter,  we 
are  concerned  only  with  query  processing  in  a  heterogeneous 
environment.  Other  problems  of  a  database  communication 
system  such  as  concurrency  control,  updating,  redundancy, 
security,  etc.,  are  subjects  for  future  research. 

In  chapter  6,  we  presented  detailed  schema  translation 
rules  for  translating  a  schema  of  a  CODASYL  data  model  to  a 
relational  schema.  We  also  presented  algorithms  for 
translating  relational  algebra  operations  into  CODASYL  DML 
statements  whose  associated  database  schemata  have 
themselves  been  translated.  Translation  algorithms  are 
developed  for  each  relational  atomic  operation.  Relational, 
network,  and  hierarchical  models  are  the  major  three  data 
models  that  have  been  used  in  commercial  database  management 
systems.  The  schema  translator  and  query  translator  for  the 
case  in  which  the  underlying  data  model  is  relational  are 
rather  easy  to  develop.  For  the  case  where  the  underlying 
data  model  is  hierarchical,  the  translation  procedures  are 
similar  to  the  case  of  the  network  model. 

Some  optimization  for  the  local  query  processing  can  be 
made  by  translating  a  subquery,  which  is  a  continuing 
subsequence  of  relational  operations  which  reference  data  at 


197 


the  sane  DBMS  into  a  sequence  of  C0DAS7L  DML  statements 
rather  than  translating  one  operation  at  a  time.  Zn  the  case 
where  the  local  processing  cost  is  comparable  to  the 
communication  cost,  this  optimization  is  essential  and  must 
be  taken  into  account.  This  is  a  rich  subject  for  future 
research. 

7.2  Conclusions 

The  main  objective  of  this  thesis  has  been  to  study 
query  processing  in  a  distributed  database  environment. 
Different  data  retrieval  strategies  generally  lead  to 
substantially  different  system  performances  in  terms  of 
response  time,  computer  utilization  and  network  traffic.  In 
practice,  communication  cost  constitutes  a  major  factor. 
Past  experience  showed  that  deciding  a  solution  strategy  for 
processing  a  given  query  is  a  very  complicated  problem.  The 
mathematical  formulation  of  the  distributed  query  processing 
of  this  thesis  provides  a  formal  model  of  this  problem,  and 
we  showed  theoretically  that  this  problem  is  indeed  a  very 
difficult  problem.  The  heuristic  algorithms  proposed  in  this 
thesis,  based  on  the  analysis  of  the  problem,  should  help  in 
finding  an  optimization  strategy  for  a  distributed  query 
processing  environment. 

The  database  communication  system  approach  in  chapter  5 
provides  a  method  of  integrating  heterogeneous  database 
management  systems.  Zt  leaves  several  problems  for  future 


research  activities.  Detailed  study  and  specification  of 
each  component  of  the  system  are  desired.  Future  research 
efforts  should  also  be  oriented  toward  prototype  system 
implementation. 

Finally,  it  is  hoped  that  more  applications  of  the 
works  in  this  thesis  will  occur  in  the  future. 
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